Hear This Idea cover image

#66 – Michael Cohen on Input Tampering in Advanced RL Agents

Hear This Idea

00:00

How to Operationalize Reward for an Agent

The paper is only talking about very advanced reinforcement learners. And then it goes into other artificial agents, but let's stick with reinforcement learners too. Success above a certain level in this environment would require hypothesizing what are the sorts of things I might need to do in this video game environment and testing those hypotheses for as much of the rest of the duration of the time you're in that video game as you can.

Play episode from 26:25
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app