
#66 – Michael Cohen on Input Tampering in Advanced RL Agents
Hear This Idea
00:00
How to Create a Helper Agent
"This isn't complicated game theory. This is just like someone offers you everything you want, and you get to pick yes or no," he says. "Those sorts of strategies where we try to use things that are a bit more advanced than us to oversee something much more advanced ... They seem to fail this way."
Play episode from 01:14:27
Transcript


