The Growth Podcast cover image

How to Do AI Evals Step-by-Step with Real Production Data | Tutorial by Hamel Husain and Shreya Shankar

The Growth Podcast

00:00

Code-based vs LLM-based evals

They recommend simple code checks for formatting and LLM judges for subjective decisions like handoffs.

Play episode from 37:56
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app