3min chapter

The Thesis Review cover image

[11] Jacob Andreas - Learning from Language

The Thesis Review

CHAPTER

Reinforcement Learning and the Language Bottleneck

The language bottleneck can help potentially like reduce these spurious correlations that the model picks up on exactly as well as well and then that would help with generalization yeah. In terms of policy learning and reinforcement learning you did apply this ideaYeah I mean honestly the reinforcement learning experiments in that paper are kind of a gross hack and I look forward to being able to point to something better to replace them with but the story there in reinforcement learning right is that it's not like the form of the experience that you get is here's an example of the agent going to its goal figure out what it was trying to do and then do it again because if that were the case you could just memorize the policy and

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode