AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Is There a Reward for Inferring What You Want?
The technique is trying to infer what the reward is. It's more like assuming a magic camera that gets to watch the person as they do, they go about their day to day life. So li you just sort of watch the human go around with their life, and you're like, ok, based on the fact that they, you know, a had cake to day, i can infer that they would like cake or something. Am i wrong about this? No, that seems right. I think you're oure complating the like evaluation that we did with the technique.