What's AI Podcast by Louis-François Bouchard cover image

Building LLM Apps & the Challenges that come with it. The What's AI Podcast Episode 16: Jay Alammar

What's AI Podcast by Louis-François Bouchard

00:00

Preference Training in Machine Learning

The training, this is the learning in machine learning of making a prediction and updating the model based on how wrong that prediction was. This step is what happens billions of, or millions of billions of times. And then you can get a little bit more, you can align the model better to those behaviors by having another sort of training step which sometimes can include re-en re-reward model. That complexity, I think a lot of people don't need to get it, like as long as you understand the language modeling objective and then this, the idea of preference, that gets you most of the understanding that you need.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app