The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) cover image

Scaling Up Test-Time Compute with Latent Reasoning with Jonas Geiping - #723

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

00:00

Understanding Model Training Dynamics

This chapter explores the intricacies of model training, focusing on the use of instruction data during pre-training to enhance interactive capabilities. It also discusses the complexities of oversight in machine learning and the impact of data mixing on model performance.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app