2min chapter

Lex Fridman Podcast cover image

Pieter Abbeel: Deep Reinforcement Learning

Lex Fridman Podcast

CHAPTER

RLS Reinforcement Learning

When I first picked up Richard Sutton's reinforcement learning book, before sort of this deep learning, RLS seemed to me like magic. The kind of part of that is, why is RL? Why does it need so many samples, so many experiences to learn from? Because really what's happening is when you have a sparse reward, you do something, maybe for luck. Some might have been good and bad in either one. And so that's why I needed so many experiences. But once you have enough experiences, effectively RL is teasing that apart. It's starting to say, OK, what is consistently there when you get a higher reward? And what's consistently there when we get a

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode