
#66 – Michael Cohen on Input Tampering in Advanced RL Agents
Hear This Idea
00:00
The Power of Pessimistic Design for an Agent
In order to get useful behavior out of it, you need to kind of help it explore. And so the other piece of this is when it seems to it like all options are bad, then it asks some mentor, which could be a human to act on its behalf. So that's how you know that this sort of design for an agent does not preclude human level intelligence. I do expect it would be able to outperform humans.
Play episode from 02:05:56
Transcript


