Latent Space: The AI Engineer Podcast cover image

RLHF 201 - with Nathan Lambert of AI2 and Interconnects

Latent Space: The AI Engineer Podcast

00:00

Debates and assumptions surrounding RLHF

RLHF is rooted in the intellectual history of utilitarianism and the V&M utility theorem, which reflects debates including whether preferences can be measured at all and the different types of math used to model preferences. This raises questions about the inductive bias of a preference model and the assumption that preferences can be accurately measured in RLHF.

Play episode from 10:08
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app