AI Breakdown

Arxiv paper - Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models

Jun 9, 2025
Ask episode
Chapters
Transcript
Episode notes