
Are Evals Dead?
MLOps.community
00:00
What traditional ML practices transfer to agents
Chiara compares agent evaluation to ML, emphasizing testing black-box inputs/outputs and added complexity from tool-calling.
Transcript
Play full episode
Chiara compares agent evaluation to ML, emphasizing testing black-box inputs/outputs and added complexity from tool-calling.