Product Thinking cover image

Episode 255: How Shopify Is Leveraging AI at Scale with Vanessa Lee

Product Thinking

00:00

Building an LLM Judge for Continuous Evaluation

Vanessa explains grading dimensions, human overlap, and tuning an LLM judge to replicate human evaluations for Sidekick.

Play episode from 15:46
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app