Chain of Thought cover image

Beyond Transformers: Maxime Labonne on Post-Training, Edge AI, and the Liquid Foundation Model Breakthrough

Chain of Thought

00:00

Choosing Post-Training Techniques by Goal

Maxime discusses trade-offs among supervised fine-tuning, DPO/PPO, and reinforcement methods depending on target capabilities like chat or reasoning.

Play episode from 23:40
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app