
Episode 27: Noam Brown, FAIR, on achieving human-level performance in poker and Diplomacy, and the power of spending compute at inference time
Generally Intelligent
Is Chain of Thought a Good Idea?
Chain of thought is very rudimentary relative to other planning. Montegoich research says, let me improve what I would do in the future and get a better estimate of what I should be doing right now. You have this really nice value function in these recreational games that you don't have with all natural language generation tasks. Co-generation is one example where you could have a value function, at least in theory.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.