
#66 – Michael Cohen on Input Tampering in Advanced RL Agents
Hear This Idea
00:00
The Importance of Advanced Artificial Agents in Reinforcement Learning
The assumption is that very advanced artificial agents are likely to plan rationally in the face of uncertainty, understanding the value of information. Let me immediately caution that I don't think this has to be true. In fact, I've designed an agent for which this assumption fails,. Which you could say is the origin of why I think it could be safe. Without a special attempt to make an agent that violates this assumption, normal progress in reinforcement learning seems likely to produce an agent that would plan rationally on unknowns.
Play episode from 34:27
Transcript


