AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
The Counterfactual Regret Minimization Method
Self play is not tied specifically to neural nets. It's a kind of reinforcement learning basically. This is very similar to how humans learn to play a game like poker, right? Like you probably played poker before and with your friends, you probably ask like, oh, would you have called me if I raise there? You know, and that's the same kind of like learning from a counterfactual that the AI is doing.