
#66 – Michael Cohen on Input Tampering in Advanced RL Agents
Hear This Idea
00:00
How to Operationalize Reward for an Agent
The paper is only talking about very advanced reinforcement learners. And then it goes into other artificial agents, but let's stick with reinforcement learners too. Success above a certain level in this environment would require hypothesizing what are the sorts of things I might need to do in this video game environment and testing those hypotheses for as much of the rest of the duration of the time you're in that video game as you can.
Play episode from 26:25
Transcript


