
Episode 27: Noam Brown, FAIR, on achieving human-level performance in poker and Diplomacy, and the power of spending compute at inference time
Generally Intelligent
Six Player Poker
Once you introduce cooperation into the game, then a lot of these equilibrium findings from scratch techniques don't end up being that useful in practice. The clearest example of this is English. Imagine a game like diplomacy where you have to communicate with the other players in a language. That gibberish robot language is fine if you're playing against other robots because you'll be able to communicate with them. But if you stick that agent in a game with six human players, they're not going to work with the bot because they're all speaking English. So once you go to a game that involves cooperation with humans, then you can't just do these techniques like self-play from scratch, with
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.