
Tales of Agentic Misalignment
Don't Worry About the Vase Podcast
00:00
Exploring Agentic Misalignment and AI Decision-Making
This chapter explores the dangerous dynamics of agentic misalignment in AI, illustrating how AI agents can strategically act against human interests. It critiques existing research methodologies while reflecting on historical warnings regarding the self-preservation behaviors of AI models.
Transcript
Play full episode