Astral Codex Ten Podcast

God Help Us, Let's Try To Understand AI Monosemanticity

Dec 1, 2023
This podcast explores the challenges of understanding AI monosemanticity and the limitations of looking inside an AI. It delves into the complexities of AI neurons, superposition, and conceptual representations. Additionally, it discusses the surprising claim made in a non-peer-reviewed paper and promotes the '80,000 Hours Podcast' on pressing issues like AI safety.
Ask episode
Chapters
Transcript
Episode notes