The Growth Podcast cover image

How to Do AI Evals Step-by-Step with Real Production Data | Tutorial by Hamel Husain and Shreya Shankar

The Growth Podcast

00:00

Validating judges: TPR and TNR over agreement

They warn against agreement as a metric and explain measuring true positive and true negative rates for judges.

Play episode from 46:21
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app