I'll Pretend to Be the AI on the Box

I think everything's on the table, to be honest with you. I try not to over index on any one particular failure mode. But more recently, Ellie Azer posited this AI at a box scenario and actually ran an experiment with humans where he said, I will pretend to be the AI on the box. Your job is to let me, or not to not let me out of the box, no matter what I say. He ran this a couple times with actual money on the line. And nobody knows what happened in those transcripts. It shouldn't be that big of a leap to think that sufficiently advanced AI might be able to trick people.

Play episode from 01:07:03

Transcript

Episode notes

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app

Effective Accelerationism and the AI Safety Debate with Bayeslord, Beff Jezoz, and Nathan Labenz

"Moment of Zen"

I'll Pretend to Be the AI on the Box

(08:00) Differences between effective accelerationism and effective altruism

(23:00) Effective accelerationism is bottoms-up

(42:00) Transhumanism

(46:00) "Equanimity amidst the singularity"

(48:30) Why AI safety is the wrong frame

(56:00) Pushing back against effective accelerationism

(01:06:00) The case for AI safety

(01:24:00) Upgrading civilizational infrastructure

(01:33:00) Effective accelerationism is anti-fragile

(01:39:00) Will we botch AI like we botched nuclear?

(01:46:00) Hidden costs of emphasizing downsides

(2:00:00) Are we in the same position as neanderthals, before humans?

(2:09:00) "Doomerism has an unpriced opportunity cost of upside"

The AI-powered Podcast Player