AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Alpha Zero - The Evolution of Alpha Go
Alpha go was a learning system, but it was specifically trained to only play go. So that removed the need for human knowledge about go. Then alpha zero then generalized it so that any things we had in there, the system, including symmetry of the go board were removed. That's the full trajectory of what you can take from imitation learning to full self supervised learning. It would have been impossible, I think, or very hard for us to build alpha zero or mew zero first out of the box.