The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) cover image

Inside s1: An o1-Style Reasoning Model That Cost Under $50 to Train with Niklas Muennighoff - #721

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

00:00

Optimizing Model Performance

This chapter explores the intricacies of enhancing model performance through controlled weight adjustments during training, revealing optimal weight factors and their impact on performance variability. It also examines the challenges of evaluating language models with a focus on compute budgets, decontamination processes, and contrasting evaluation methods such as budget forcing and rejection sampling.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app