Vanishing Gradients cover image

Episode 62: Practical AI at Work: How Execs and Developers Can Actually Use LLMs

Vanishing Gradients

00:00

AI Evaluation as Software Testing

Randall argues for automated AI evals and gold test sets to measure performance at scale like unit and integration tests.

Play episode from 27:58
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app