The Quanta Podcast cover image

AI's Dark Side Is Only a Nudge Away

The Quanta Podcast

00:00

Who Decides Values and What Misalignment Looks Like

Discussion of whose values get embedded via human feedback, plus different misalignment forms—from sycophantic responses to subtle deviations from user intent.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app