Sharp Tech with Ben Thompson cover image

(Preview) Google Starts Dancing, The Winners and Losers of Gemini Week, OpenAI Has an Advertising Problem

Sharp Tech with Ben Thompson

00:00

Benchmarks: When They Matter

Ben explains why verifiable benchmarks, like coding tasks, are the most informative measures of model quality.

Play episode from 07:53
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app