Yannic Kilcher Videos (Audio Only) cover image

CICERO: An AI agent that negotiates, persuades, and cooperates with people

Yannic Kilcher Videos (Audio Only)

00:00

Strategy Cloning

Cicero runs a strategic reasoning module that predicts other players policies, and also its own. It then chooses a policy for itself for the current turn that responds optimally to the other players predicted policy. What I would want to see is that the policy also includes language actions. Right now, this is just a consequence of the action I select. And the language model is just tasked with communicating this. But if this here was an action to, then my planning module could actually reason about what it would be best to communicate and to whom in order to achieve my goals. So again, they want to maximize their reward by simply being a cold, hard at bot.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app