
Scaling Up Test-Time Compute with Latent Reasoning with Jonas Geiping - #723
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
00:00
Visualizing Recurrent States in Machine Learning
This chapter explores the evolution of random states in machine learning models as they predict tokens, illustrating the unexpected behaviors that emerge during this process. It also examines the relationship between recurrent and diffusion models, highlighting both their conceptual similarities and distinct training challenges in language processing.
Transcript
Play full episode