The Stack Overflow Podcast cover image

Every ecommerce hero needs a Sidekick

The Stack Overflow Podcast

00:00

Preventing hallucinations through evals

Vanessa details building evaluation datasets, LLM judges, and using synthetic and human-in-the-loop grading to avoid errors.

Play episode from 06:55
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app