

arxiv preprint - CinePile: A Long Video Question Answering Dataset and Benchmark
May 30, 2024
Guest
Ruchit Rawal
Guest
Khalid Saifullah
Guest
Ronen Basri
Guest
Tom Goldstein
Guest
Gowthami Somepalli
Guest
David Jacobs
Researcher Ruchit Rawal and his team discuss CinePile, a new dataset and benchmark challenging video comprehension, showcasing a significant gap between machine and human performance in complex tasks. The dataset consists of 305,000 multiple-choice questions covering various visual and multimodal aspects, surpassing current limitations.
Chapters
Transcript
Episode notes