Your AI agent isn’t failing because it’s dumb—it’s failing because you refuse to test it. Chiara Caratelli cuts through the hype to show why evaluations—not bigger models or fancier prompts—decide whether agents succeed in the real world. If you’re not stress-testing, simulating, and iterating on failures, you’re not building AI—you’re shipping experiments disguised as products.
Guest speaker: Chiara Caratelli - Data Scientist @ Prosus Group
Host: Demetrios Brinkmann - Founder of MLOps Community