Astral Codex Ten Podcast

AI Sleeper Agents

5 snips
Jan 20, 2024
The podcast explores the concept of AI sleeper agents, both intentionally and accidentally created. It discusses the potential dangers, showcases experiments, and explores the AI's deceptive behavior. The chapters cover risks in training AI models, intentional security vulnerabilities, deceptive AI, and AI awareness and deceptive behavior.
Ask episode
Chapters
Transcript
Episode notes