Lenny's Podcast: Product | Career | Growth cover image

The 100-person AI lab that became Anthropic and Google's secret weapon | Edwin Chen (Surge AI)

Lenny's Podcast: Product | Career | Growth

00:00

Measuring progress with human evaluations

Edwin explains Surge's deep human evals using expert annotators to assess models on realistic, domain-specific tasks.

Play episode from 20:15
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app