4min chapter

Generally Intelligent cover image

Episode 27: Noam Brown, FAIR, on achieving human-level performance in poker and Diplomacy, and the power of spending compute at inference time

Generally Intelligent

CHAPTER

The Dialog Model Is Used to Predict What All the Players Are Going to Do

The dialogue model takes into account the like board state, the history, the history of dialogue in general that has happened across all the players. We feed that into a large language model and then we use that to predict what all the players are going to do for the current term. It also gives us the intents that we feed into the dialogue model. So it gives us the action that we're going to play and then theaction that we would like the other player to play as well. And this dialogue model is trained on the human data. We actually filter out deceptive messages from the training data for this dialogue generation part.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode