3min chapter

What's AI Podcast by Louis-François Bouchard cover image

Building LLM Apps & the Challenges that come with it. The What's AI Podcast Episode 16: Jay Alammar

What's AI Podcast by Louis-François Bouchard

CHAPTER

Preference Training in Machine Learning

The training, this is the learning in machine learning of making a prediction and updating the model based on how wrong that prediction was. This step is what happens billions of, or millions of billions of times. And then you can get a little bit more, you can align the model better to those behaviors by having another sort of training step which sometimes can include re-en re-reward model. That complexity, I think a lot of people don't need to get it, like as long as you understand the language modeling objective and then this, the idea of preference, that gets you most of the understanding that you need.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode