Can Alignment Be Deeper? Practical Risks

Researchers lack robust fixes; the episode explores interpretability needs, brittle alignment, and cautions for everyday users and sensitive applications like mental health.

Play episode from 16:10

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app