
Are Evals Dead?
MLOps.community
00:00
Bootstrapping evaluation sets and curated test cases
Chiara describes creating curated examples and treating prompt design like a data science task to bootstrap evals.
Play episode from 01:35
Transcript

Chiara describes creating curated examples and treating prompt design like a data science task to bootstrap evals.