Generally Intelligent cover image

Episode 28: Sergey Levine, UC Berkeley, on the bottlenecks to generalization in reinforcement learning, why simulation is doomed to succeed, and how to pick good research problems

Generally Intelligent

00:00

Large Scale Regression Learning (RL) in Atari Games

In RL, you need to do this kind of factual reasoning pretty implicitly. And so you need to represent these sub-optimal behaviors. But in a language model, you don't need to,. they're often quite bad at a counterfactual reasoning. So there's something interesting here.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner
Get the app