
Scaling Up Test-Time Compute with Latent Reasoning with Jonas Geiping - #723
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
 00:00 
Understanding Model Training Dynamics
This chapter explores the intricacies of model training, focusing on the use of instruction data during pre-training to enhance interactive capabilities. It also discusses the complexities of oversight in machine learning and the impact of data mixing on model performance.
 Transcript 
 Play full episode 


