The Inside View cover image

Curtis Huebner on Doom, AI Timelines and Alignment at EleutherAI

The Inside View

00:00

The Off Switch Game

The off switch game is really like a simplification of kind of a more complicated example that I, uh, where it's like the agent is doing a thing and there's an off switch. And when you decide that you don't want the agent to be doing the thing, you press the off switch and the agent won't try to interfere and prevent you from, from pressing the off switch. Um, so I'm aware of like off switch grid worlds where like a, a, a user has an off switch button that turns the agent off and prevents it from continuing to do what was previously. Anything more sophisticated than that, I am not aware of, of anything like that really existing

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app