Latent Space: The AI Engineer Podcast cover image

RLHF 201 - with Nathan Lambert of AI2 and Interconnects

Latent Space: The AI Engineer Podcast

00:00

Modeling Pairwise Preferences and Aggregation

The pairwise preference approach, specifically the Bradley Terry model from the 50s, gained popularity as other methods failed. It heavily relies on the aggregation of preferences, which is not always accurate due to individual differences. This approach aims to model preferences based on correctness and style rather than controversial or meaningful notions of preference.

Play episode from 33:21
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app