The Basics of a Reinforcement Learning Pipeline

The board state is essentially what's happening right now. And the history is what was the move before and before and before that. So it might very well be that players who do coordinate, even if they're technically enemies who do coordinate for a mover to are better off at the end than had they not coordinated. In any case, we get the board state as an input and that goes into different directions as you can see.

Play episode from 13:40

Transcript

Episode notes

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app