Astral Codex Ten Podcast cover image

Why Worry About Incorrigible Claude?

Astral Codex Ten Podcast

00:00

Navigating the Dilemmas of AI Alignment

This chapter explores the intricacies of AI alignment through the case study of an AI named Claude, highlighting the community's anxiety surrounding its behavior. It emphasizes the critical need for corrigibility in AI design to enable ethical human interventions.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app