
AI's Dark Side Is Only a Nudge Away
The Quanta Podcast
00:00
Who Decides Values and What Misalignment Looks Like
Discussion of whose values get embedded via human feedback, plus different misalignment forms—from sycophantic responses to subtle deviations from user intent.
Transcript
Play full episode