Behind the Craft cover image

Notion's New AI Agents for Work Blew Me Away (Full Tutorial) | Akshay & Ryan

Behind the Craft

00:00

Quality control: evals, unit tests, and human feedback

Ryan describes their large eval suite, golden and hard test sets, and blending LLM judges with manual scoring for reliability.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app