AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
How to Train a Student Agent to Navigate Mazes?
We started with the grit worlds, and we trained agents to basically generate mazes. And then we take it out of that kind of space and actually present it with handcrafted, quite tricky mazes. We see strong zero shot generalization to these kind of held out problems. Then we thoght ok. If that works for mazes, maybe we can also do this in a continuous control environment. So we actually moved over to car racing, and we let the teacher to basically overtime generate formula one tracks. I mean, not actual formula one tracks, but basically race tracks. And we have te stuen trying to get through these as quickly as possible.