Unsupervised Learning with Jacob Effron cover image

Ep 77: Anthropic’s Dianne Na Penn on Opus 4.5, Rethinking Model Scaffolding & Safety as a Competitive Advantage

Unsupervised Learning with Jacob Effron

00:00

Evolving Eval Design Beyond Benchmarks

Dianne argues Sweetbench-style tasks are saturated and calls for open-ended, quantifiable evals like Vending Bench variants.

Play episode from 23:17
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app