
#66 – Michael Cohen on Input Tampering in Advanced RL Agents
Hear This Idea
00:00
The Limits of Advanced RL Agents
It could be that this sort of sufficient advancement is quite a bit more advanced than than we are. Maybe I don't I doubt that those arguments would go through. If you don't take over the world, you get shut off. If you do anything that looks bad, and so it would seem to be a poor choice of actions for long term or maximization. Okay, well, here's another, I guess, relevant question. Let's just see what happens when we like relax some of the assumptions or imagine like kind of somewhat but not wildly advanced agents do similar problems apply? Well, it seems it seems plausible. Like maybe it's not taking over the world,. But maybe it
Play episode from 01:08:06
Transcript


