AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Is There a Way to Solve Rubics Cubes?
How do we discover this on our own, right? That's the key question. One interesting answer that one of my page te students at miguel pierle bacon a few years back is called option critic. It's basically an algorism that tries to use the reward from the environment and try to find sub galls that are on the path to rewarding states. But sometimes we still observe things like the agent using abstractions for some bit and then kind of collapsing them away, like getting rid of them. If they had a very rich, very large environment, complex environment, with many things to do, they may need to keep these abstractions around.