
#526: Building Data Science with Foundation LLM Models
Talk Python To Me
00:00
Measuring Model Changes with Test Sets
Michael asks about model swaps and Hugo recommends building gold-standard test sets to evaluate new models with data, not vibes.
Play episode from 01:01:19
Transcript


