What's AI Podcast by Louis-François Bouchard cover image

Building LLM Apps & the Challenges that come with it. The What's AI Podcast Episode 16: Jay Alammar

What's AI Podcast by Louis-François Bouchard

CHAPTER

Preference Training in Machine Learning

The training, this is the learning in machine learning of making a prediction and updating the model based on how wrong that prediction was. This step is what happens billions of, or millions of billions of times. And then you can get a little bit more, you can align the model better to those behaviors by having another sort of training step which sometimes can include re-en re-reward model. That complexity, I think a lot of people don't need to get it, like as long as you understand the language modeling objective and then this, the idea of preference, that gets you most of the understanding that you need.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner