Lenny's Podcast: Product | Career | Growth cover image

Al Engineering 101 with Chip Huyen (Nvidia, Stanford, Netflix)

Lenny's Podcast: Product | Career | Growth

00:00

How reinforcement learning from human feedback (RLHF) works

Lenny prompts a discussion of RLHF; Chip outlines human comparisons, reward models, and using AI or verifiable signals for rewards.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app