
#66 – Michael Cohen on Input Tampering in Advanced RL Agents
Hear This Idea
00:00
The Importance of Intervening in Computer Programming
Hacking is easier than taking over the world, but I'm saying it has to take over the world too. So before it does anything with recognizable effects, I guess we don't need to worry so much about the things that happened before there were recognizable effects because they're not so strategically relevant. If it just hacked into it's into the computer running it and changed some lines of code, it would run a real risk of being shut down. And so that would be a poor strategy for getting maximal reward forever.
Play episode from 57:50
Transcript


