5min chapter

#114 - Secrets of Deep Reinforcement Learning (Minqi Jiang)

Machine Learning Street Talk (MLST)

CHAPTER

The Limits of Reinforcement Learning

The data that we use to train them is very truncated in terms of the number of environments you can generate, which by extension means that any intelligence derivable from those systems is limited. But I think what's interesting is that what we're finding is that a lot of these really impressive abilities that are being exhibited by the large language models can actually be classified as a form of emergent behavior. So yeah, so many interesting things there. Later on, we'll go deeper into the point of people think reinforcement learning is quite an open-ended process. The only fly in the ointment, and we'll talk about intelligence properly in a minute, but Shane Leg does have a definition

00:00

Transcript

Episode notes

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

5min chapter

#114 - Secrets of Deep Reinforcement Learning (Minqi Jiang)

Machine Learning Street Talk (MLST)

Get the Snipdpodcast app

AI-poweredpodcast player

Discoverhighlights

Save anymoment

Share& Export

AI-poweredpodcast player

Discoverhighlights

Get the Snipd
podcast app

AI-powered
podcast player

Discover
highlights

Save any
moment

Share
& Export

AI-powered
podcast player

Discover
highlights