The Stop Button Problem and the Correct Ability Problem

The problem of making an AI or Bayou when you wanted to obey you is called the stop button problem. It's more about an AI which will obey your command, but it's still burned by the laws of physics. In taught experiments, and you can also validate these in small toy world simulations, if you say, okay, please obey to my future self, who knows better? That actually would give the AI an incentive to manipulate your future self.

Play episode from 03:17

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app