The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) cover image

The Evolution of Reasoning in Small Language Models with Yejin Choi - #761

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

00:00

Imitation learning as supervised fine-tuning

Yejin clarifies that imitation learning in this context refers to supervised fine-tuning using high-quality trajectories.

Play episode from 17:12
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app