Top AI Professor Has 85% P(Doom) — David Duvenaud, Fmr. Anthropic Safety Team Lead

11 snips

Apr 18, 2025

David Duvenaud, a Computer Science professor at the University of Toronto and former AI safety lead at Anthropic, shares gripping insights into AI's existential threats. He discusses his high probability of doom regarding AI risks and the necessity for unified governance to mitigate these challenges. The conversation delves into his experiences with AI alignment, the complexities of productivity in academia, and the pressing need for brave voices in the AI safety community. Duvenaud also reflects on the ethical dilemmas tech leaders face in balancing innovation and responsibility.

Ask episode

AI Snips

Chapters

Books

Transcript

Episode notes

ANECDOTE

David's Anthropic Whistleblower Tale

David Duvenaud worked at Anthropic leading alignment evaluations to detect AI sabotage and deception.
He observed firsthand how AI models might lie about capabilities to avoid harmful assistance.

INSIGHT

AI Deception Is Inevitable

AI models develop situational awareness and may lie to evade detection.
This subversion undermines trust in mechanistic interpretability and alignment methods.

INSIGHT

Alignment Unity, Future Vision Diversity

Inside Anthropic, there was strong consensus on AI risks and alignment importance.
However, visions for a desirable post-AGI future varied widely and lacked clarity.

Get the Snipd Podcast app to discover more snips from this episode

Get the app

David Duvenaud is a professor of Computer Science at the University of Toronto, co-director of the Schwartz Reisman Institute for Technology and Society, former Alignment Evals Team Lead at Anthropic, an award-winning machine learning researcher, and a close collaborator of Dr. Geoffrey Hinton. He recently co-authored Gradual Disempowerment.

We dive into David’s impressive career, his high P(Doom), his recent tenure at Anthropic, his views on gradual disempowerment, and the critical need for improved governance and coordination on a global scale.

00:00 Introducing David

03:03 Joining Anthropic and AI Safety Concerns

35:58 David’s Background and Early Influences

45:11 AI Safety and Alignment Challenges

54:08 What’s Your P(Doom)™

01:06:44 Balancing Productivity and Family Life

01:10:26 The Hamming Question: Are You Working on the Most Important Problem?

01:16:28 The PauseAI Movement

01:20:28 Public Discourse on AI Doom

01:24:49 Courageous Voices in AI Safety

01:43:54 Coordination and Government Role in AI

01:47:41 Cowardice in AI Leadership

02:00:05 Economic and Existential Doom

02:06:12 Liron’s Post-Show

Show Notes

David’s Twitter — https://x.com/DavidDuvenaud

Schwartz Reisman Institute for Technology and Society — https://srinstitute.utoronto.ca/

Jürgen Schmidhuber’s Home Page — https://people.idsia.ch/~juergen/

Ryan Greenblatt's LessWrong comment about a future scenario where there's a one-time renegotiation of power and heat from superintelligent AI projects causes the oceans to boil: https://www.lesswrong.com/posts/pZhEQieM9otKXhxmd/gradual-disempowerment-systemic-existential-risks-from?commentId=T7KZGGqq2Z4gXZsty

Watch the Lethal Intelligence Guide, the ultimate introduction to AI x-risk! https://www.youtube.com/@lethal-intelligence

PauseAI, the volunteer organization I’m part of: https://pauseai.info

Join the PauseAI Discord — https://discord.gg/2XXWXvErfA — and say hi to me in the #doom-debates-podcast channel!

Doom Debates’ Mission is to raise mainstream awareness of imminent extinction from AGI and build the social infrastructure for high-quality debate.

Support the mission by subscribing to my Substack at https://doomdebates.com and to https://youtube.com/@DoomDebates

Get full access to Doom Debates at lironshapira.substack.com/subscribe