
Are Evals Dead?
MLOps.community
00:00
Using model judges carefully and surfacing real errors
Chiara warns that LLM judges can overlook faults and encourages instructing them to find and report real errors.
Transcript
Play full episode
Chiara warns that LLM judges can overlook faults and encourages instructing them to find and report real errors.