TBPN cover image

Ilya Sutskever on Dwarkesh Patel Reaction, NVIDIA’s Response to Google’s AI Progress, Trump Unveils Genesis | Diet TBPN

TBPN

00:00

Pretraining, RL and data selection pitfalls

Hosts contrast pretraining with reinforcement learning and warn about reward hacking and data biases.

Play episode from 05:32
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app