
AGI Can Be Safe
Data Skeptic
00:00
The Stop Button Problem and the Correct Ability Problem
The stop button problem or correctability problem is a major area of study in AI safety./nThis problem deals with making an AI fully obedient, but ensuring that the AI obeys commands that align with human values./nObedience to a single, long-lasting command can lead to unforeseen consequences, as seen in the paperclip optimizer thought experiment./nMathematical research is being conducted to determine what kind of obedience is needed for an AI to make safe decisions that align with human values.
Transcript
Play full episode