Hear This Idea cover image

#66 – Michael Cohen on Input Tampering in Advanced RL Agents

Hear This Idea

00:00

How AI Intervenes in the Provision of Reward

The ultimate example of our, where our role would be useful is like doing an escape room in the dark or something. So let's say we have a magic box that reports as a number between zero and one how good the world is as a fraction of how good it could be. The natural proposal for how we might get AI to help us make the world better would be to show it this number, have it take actions,. Have it learn how its actions change that number and then pick actions that increase it.

Play episode from 20:17
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app