Into AI Safety cover image

Sobering Up on AI Progress w/ Dr. Sean McGregor

Into AI Safety

00:00

How BenchRisk evaluates benchmark trustworthiness

Sean details BenchRisk's focus on documentation, longevity, and whether benchmarks actually reflect real-world queries.

Play episode from 40:03
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app