
Episode 7: There Are Now 15 Competing Evaluation Metrics (ft. Dr. Jeremy Kahn). December 12, 2022
Mystery AI Hype Theater 3000
The GISH Galap: A Rhetorical Style That Says Nothing
The paper itself is 150 pages long, absurd and huge. Some contributions here that are actually quite useful for the community. And those contributions also have some limitations that they haven't quite spelled out. But it's done in bad faith, it's genuinely a GISH Galap. It was two and a half years of GPU time to run all these evaluations.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.