Hear This Idea cover image

#66 – Michael Cohen on Input Tampering in Advanced RL Agents

Hear This Idea

00:00

How to Create a Helper Agent

"This isn't complicated game theory. This is just like someone offers you everything you want, and you get to pick yes or no," he says. "Those sorts of strategies where we try to use things that are a bit more advanced than us to oversee something much more advanced ... They seem to fail this way."

Play episode from 01:14:27
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app