
#66 – Michael Cohen on Input Tampering in Advanced RL Agents
Hear This Idea
00:00
How to Make an Agent More Risk Averse
The hope is that you can generate useful agents which have a relatively short horizon. And I guess another also related idea is the idea of making an agent more risk averse. How is that distinct from what we talked about? So the idea here is to make a conservative agent that avoids causing radical new things in the world such as killing everyone.
Play episode from 02:02:54
Transcript


