AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Are the Poker Bots Still Beating Experts?
The significance of search is something that I think has been under appreciated by the field. The first paper I wrote on Hanabi, we applied search instead of reinforcement learning. We just took a handcrafted heuristic bot that was like the baseline that everybody would beat. And we added planning like a really, really simple form of search. It's actually the dumbest possible search you can do. That got to super human performance in self play Hanabi.