
Scaling Up Test-Time Compute with Latent Reasoning with Jonas Geiping - #723
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
Understanding Model Training Dynamics
This chapter explores the intricacies of model training, focusing on the use of instruction data during pre-training to enhance interactive capabilities. It also discusses the complexities of oversight in machine learning and the impact of data mixing on model performance.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.