Dwarkesh Podcast cover image

Some thoughts on the Sutton interview

Dwarkesh Podcast

00:00

Imitation Learning Enables Ground-Truth RL

Dwarkesh Patel explains pre-trained models provide priors enabling RL to solve ground‑truth tasks like math and coding.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app