The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) cover image

Scaling Up Test-Time Compute with Latent Reasoning with Jonas Geiping - #723

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

CHAPTER

Understanding Model Training Dynamics

This chapter explores the intricacies of model training, focusing on the use of instruction data during pre-training to enhance interactive capabilities. It also discusses the complexities of oversight in machine learning and the impact of data mixing on model performance.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner