Is There a Good Noap Policy?

Under the noop or action policy, more generally, it seems like the agent would stay able to do the right thing. But there's a reasons er, i, like, last time i thought about this, i remember concluding that this isn't a full story. And so one frame i have is that a good noap policy is one that is going to preserve the agent's power to optimize some true objectives we mighta wish to give the agent later.

Play episode from 11:25

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app