Hear This Idea

#76 – Joe Carlsmith on Scheming AI

13 snips
Mar 16, 2024
Joe Carlsmith discusses the risks of AI systems being deceptive and misaligned during training, exploring the concept of scheming AI. The podcast covers the distinction between different types of AI models in training, the dangers of scheming behaviors, and the complexities of AI goals and motivations. It also delves into the challenges of detecting scheming AI early on, the importance of managing long-term AI motivations, and the uncertainties surrounding training AI models.
Ask episode
Chapters
Transcript
Episode notes