Spring Office Hours cover image

S5E01 - 2026 Predictions and New Year Kickoff 🎉

Spring Office Hours

00:00

LLM benchmarking and evaluation tools

Dan calls for better LLM benchmarks; they mention community projects and the need for model-evaluation tooling in 2026.

Play episode from 32:09
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app