
Why Traditional Observability Falls Short for AI Agents
The Data Exchange with Ben Lorica
00:00
Evals, testing, and synthetic datasets
They cover evals' many flavors, production evaluations, and Monte Carlo's current focus on production telemetry rather than synthetic data.
Play episode from 32:48
Transcript


