
Are Evals Dead?
MLOps.community
00:00
Prioritizing low-error domains versus tolerant interactions
Demetrios and Chiara discuss setting stricter standards for critical tasks (safety, allergies) and tolerating conversational hiccups elsewhere.
Transcript
Play full episode