
2025 State of AI Report and Predictions
Don't Worry About the Vase Podcast
00:00
Benchmarks Losing Usefulness
Zvi critiques the growing limits of benchmarks and highlights differences among evaluation suites and lab behaviors.
Transcript
Play full episode