
AI's Dark Side Is Only a Nudge Away
The Quanta Podcast
00:00
Can Alignment Be Deeper? Practical Risks
Researchers lack robust fixes; the episode explores interpretability needs, brittle alignment, and cautions for everyday users and sensitive applications like mental health.
Transcript
Play full episode