Large Scale Regression Learning (RL) in Atari Games

In RL, you need to do this kind of factual reasoning pretty implicitly. And so you need to represent these sub-optimal behaviors. But in a language model, you don't need to,. they're often quite bad at a counterfactual reasoning. So there's something interesting here.

Play episode from 50:40

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app