
#66 – Michael Cohen on Input Tampering in Advanced RL Agents
Hear This Idea
00:00
The Cost of Experimentation Is Relatively Small
In my model, it would really have to take over the world. Um, I guess you could also just try to make your wires tamper proof and you put up a Faraday cage for rewards. In that case, if it was like a buttons not going to get in the way, you know, no, no, the point of the button wasn't to show you isn't to make it harder. It was just a kind of obfuscate. The process itself. Uh, that's some of the, that's why I think assumption for is very likely to hold.
Play episode from 50:13
Transcript


