"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

Full-Stack AI Safety: Why Defense-in-Depth Might Work, with Far.AI CEO Adam Gleave

95 snips
Sep 20, 2025
Adam Gleave, co-founder and CEO of FAR AI, discusses his organization's vital work in AI safety. He shares insights on the 'defense-in-depth' strategy to navigate potential risks from advanced AI systems. Gleave explores the future landscape post-AGI, emphasizing the complexities of achieving full autonomy. He highlights innovative approaches like using 'lie detectors' for AI deception detection and the importance of interpretability in AI planning. His cautious optimism underscores that meticulous planning and design can significantly enhance AI safety.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Plausible Post-AGI Life

  • Adam Gleave envisions a plausible post-AGI world where humans are materially comfortable but not fully in control.
  • He likens it to being a moderately privileged historical class with high living standards but limited agency.
INSIGHT

Three-Tier Capability Framework

  • Gleave defines three capability tiers: powerful tool AIs, agentic attackers, and fully autonomous AI organizations.
  • He assigns timelines: now for tools, ~1–7 years for agentic misuse, and a median ~14 years for full AI organizations.
INSIGHT

Spiky Skills Slow Full Automation

  • Gleave expects AI skill-profiles to be 'spiky' so humans retain advantages in long-horizon, sample-efficient tasks.
  • This spikiness delays fully AI-run organizations until architectures or memory efficiency improve.
Get the Snipd Podcast app to discover more snips from this episode
Get the app