
Scaling Up Test-Time Compute with Latent Reasoning with Jonas Geiping - #723
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
00:00
Understanding Model Training Dynamics
This chapter explores the intricacies of model training, focusing on the use of instruction data during pre-training to enhance interactive capabilities. It also discusses the complexities of oversight in machine learning and the impact of data mixing on model performance.
Transcript
Play full episode