The MAD Podcast with Matt Turck cover image

Are We Misreading the AI Exponential? Julian Schrittwieser on Move 37 & Scaling RL (Anthropic)

The MAD Podcast with Matt Turck

00:00

RL Training Data: Quality, Quantity, and Stability

Julian explains RL data comes from the model, quality matters for stable RL, and planning-intensive generation yields high-quality training data.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app