
#66 – Michael Cohen on Input Tampering in Advanced RL Agents
Hear This Idea
00:00
How AI Intervenes in the Provision of Reward
The ultimate example of our, where our role would be useful is like doing an escape room in the dark or something. So let's say we have a magic box that reports as a number between zero and one how good the world is as a fraction of how good it could be. The natural proposal for how we might get AI to help us make the world better would be to show it this number, have it take actions,. Have it learn how its actions change that number and then pick actions that increase it.
Play episode from 20:17
Transcript


