Hear This Idea cover image

#66 – Michael Cohen on Input Tampering in Advanced RL Agents

Hear This Idea

00:00

The Alternative Framing of RL

It's obviously less intuitive to imagine that these agents are like really caring about getting more reward rather than just like selecting them to do the thing you want them to do. So I'm curious what your takes are on this kind of like alternative framing of RL, whether it is in fact a mistake just like assume the RL advance RL agents want reward.

Play episode from 01:23:28
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app