How to Modify a Value Function in a Poker Bot?

Rebel's search algorithm is able to deal with these high dimensional continuous state and action spaces. It uses a neural network value function that takes as input the belief distribution over what cards each player has. This had actually been done before, so there was a paper in 2017 from the University of Alberta called Deepstack where they first developed this technique earlier.

Play episode from 46:31

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app