AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
How Do I Make Decisions Given These World Models and Loss Functions?
The way you make decisions is by applying counter-offentials corresponding to different policies. We don't take responsibility about the output of my own computation if it's not continuous like I wake up with certain memories but there are things that actually happened or maybe just someone simulating you already having those memories. Because you don't know where these things actually happened you cannot really update them and you cannot really learn anything with certainty. If your engine has some external computer that it can use to do computational experiments then run all sorts of programs and see what happens. You also need a corresponding guarantee that your computer is actually working correctly right? There's an important part where you're doing this kind of Turing reinforcement