Generally Intelligent cover image

Episode 27: Noam Brown, FAIR, on achieving human-level performance in poker and Diplomacy, and the power of spending compute at inference time

Generally Intelligent

CHAPTER

What Are the Biggest Breakthroughs in Cicero?

Did you hard code anything about the game roles or things like that in order for it to be able to do training effectively? We have the rules of the game in the transition function. So we're able to step through like given the action that everybody picks for the current turn, we can then see like okay, according to therules of the game, this is the new state that we end up in. And then we feed that state into the value function instead of feeding the actions of all the players into thevalue function. Right. That makes sense. Do you feel like in terms of progressing so quickly from no practical press? And I know you were working on it all at the same

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner