Latent Space: The AI Engineer Podcast cover image

Brex’s AI Hail Mary — With CTO James Reggio

Latent Space: The AI Engineer Podcast

00:00

Tracking Feature Sophistication with Failing Tests

Reggio discusses writing long-lived failing evals to measure progression as assistants gain new capabilities.

Play episode from 56:00
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app