Yannic Kilcher Videos (Audio Only) cover image

CICERO: An AI agent that negotiates, persuades, and cooperates with people

Yannic Kilcher Videos (Audio Only)

00:00

Cicero and DIL PIKL

Cicero computes an anchor policy for both itself and the player based on their shared conversation, the board state and the recent action history. Cicero then ran DIL PIKL, which is their variant of PIKL that not only includes two players, but I think, is that the variant? I think I'm describing the right thing here. Okay. For the two players in order to predict player J's policy on each iteration, Cicero assumed the five remaining player would play according to a policy computed via RL. So there's a lot of adjustment happening for the fact they don't have all the information.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app