Justified Posteriors cover image

Evaluating GDPVal, OpenAI's Eval for Economic Value

Justified Posteriors

00:00

What were the headline win-rate results across models?

The hosts report Claude Opus near parity at 47.6% and GPT-5 High at 38.8%, surprising their priors.

Play episode from 23:50
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app