arxiv preprint - CinePile: A Long Video Question Answering Dataset and Benchmark

May 30, 2024

Guest

Ruchit Rawal

Guest

Researcher Ruchit Rawal and his team discuss CinePile, a new dataset and benchmark challenging video comprehension, showcasing a significant gap between machine and human performance in complex tasks. The dataset consists of 305,000 multiple-choice questions covering various visual and multimodal aspects, surpassing current limitations.

Ask episode

Chapters

Transcript

Episode notes

Introduction

00:00 • 2min

Exploring the CinePile Dataset and Model Benchmarking

01:52 • 2min

Exploring Cinepile: A Benchmark for Video Comprehension and Question Answering

03:30 • 2min