80,000 Hours Podcast cover image

#80 – Stuart Russell on why our approach to AI is broken and how to fix it

80,000 Hours Podcast

00:00

Understanding Inverse Reinforcement Learning and Human Preferences

This chapter explores inverse reinforcement learning, focusing on how machines can predict human preferences through behavior analysis. It contrasts this approach with traditional reinforcement learning and emphasizes the role of human interactions in uncovering motivations and refining machine understanding of individual preferences.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app