"Age of Miracles" cover image

Anton Teaches Packy AI | Ep 2 | Chinchilla

"Age of Miracles"

00:00

The Saturation of Benchmarks for Generative Text Models

There's another problem kind of hiding here in plain sight, which is that we're starting to saturate benchmarks. But qualitatively, the performance is definitely still improving. How do we measure the performance of these models is like a big, open question now.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app