Eye On A.I. cover image

#298 Ryan Kolln: How Appen Trains the World's Most Powerful AI Models

Eye On A.I.

00:00

Why benchmarks alone don't measure real-world model quality

Craig asks about evaluation evolution; Ryan explains limitations of narrow benchmarks and the need for broader user-centered measures.

Play episode from 03:00
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app