Get the app
Andrew Gordon
Staff researcher in behavioral science at Prolific focusing on human-centered evaluation of AI and online research methodologies, and co-developer of Prolific's Humane leaderboard.
Best podcasts with Andrew Gordon
Ranked by the Snipd community
57 snips
Dec 20, 2025
• 16min
Are AI Benchmarks Telling The Full Story? [SPONSORED] (Andrew Gordon and Nora Petrova - Prolific)
chevron_right
Join Andrew Gordon, a behavioral science researcher at Prolific, and AI expert Nora Petrova as they delve into the flaws of current AI benchmarking. They challenge the notion that high scores mean better models, using a Formula 1 car as an analogy. The discussion touches on critical issues like AI safety, especially in sensitive contexts like mental health, and critiques the biases in popular ranking systems. Discover how Prolific's innovative HUMAINE framework and TrueSkill methodology aim to create a more human-centered evaluation of AI.
The AI-powered Podcast Player
Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
Get the app