Microsoft Research Podcast cover image

AI Frontiers: The future of scale with Ahmed Awadallah and Ashley Llorens

Microsoft Research Podcast

00:00

Exploring the Concept of Post-Training in Language Model Training

This chapter explores the concept of post-training in language model training and how it helps align models with human values. It covers reinforcement learning from human feedback and the steps involved in teaching the model to respond in a helpful and harmless manner.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner
Get the app