The Top 3 Properties of a Prism

We give the agent a little bit of information, and that we penalize the agent compared to in action. And what we're saying is, well, inaction would be good for preserving your ability o do the right thing. Another nice thing is a fall op work demonstrated that you don't need that many auxiliary goals to to get a good penalty term. So i don't think we'll talk about it as much but i would say that those are the top three.

Play episode from 07:29

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app