AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Is There a Good Noap Policy?
Under the noop or action policy, more generally, it seems like the agent would stay able to do the right thing. But there's a reasons er, i, like, last time i thought about this, i remember concluding that this isn't a full story. And so one frame i have is that a good noap policy is one that is going to preserve the agent's power to optimize some true objectives we mighta wish to give the agent later.