
Models, Evals, and Raptor Mini with Julia Kasper
VS Code Insiders Podcast
00:00
Model Evaluations and VS Code Benchmarks
Julia outlines online vs offline evals, metrics like time‑to‑first‑token, VSC bench, and challenges in AI evaluation.
Play episode from 16:20
Transcript


