Credibility and Commitments in Multi-Agent Systems

This chapter explores the strategic implications of agents' ability to make credible commitments in multi-agent systems, discussing commitment races, transparency, AI misalignment scenarios, and the offense-defense theory in relation to AI deployment.

Play episode from 18:27

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app