Yannic Kilcher Videos (Audio Only) cover image

CICERO: An AI agent that negotiates, persuades, and cooperates with people

Yannic Kilcher Videos (Audio Only)

00:00

The Basics of a Reinforcement Learning Pipeline

The board state is essentially what's happening right now. And the history is what was the move before and before and before that. So it might very well be that players who do coordinate, even if they're technically enemies who do coordinate for a mover to are better off at the end than had they not coordinated. In any case, we get the board state as an input and that goes into different directions as you can see.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app