Why Hiding Evidence of Misalignment Is Dangerous

Alexa and Ryan explain how training that suppresses warning signs undermines our ability to detect and iterate on misalignment risks.

Play episode from 22:58

Transcript

Episode notes

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!