AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
The Top 3 Properties of a Prism
We give the agent a little bit of information, and that we penalize the agent compared to in action. And what we're saying is, well, inaction would be good for preserving your ability o do the right thing. Another nice thing is a fall op work demonstrated that you don't need that many auxiliary goals to to get a good penalty term. So i don't think we'll talk about it as much but i would say that those are the top three.