

ColPali: Document Retrieval with Vision-Language Models only (with Manuel Faysse)
7 snips Sep 27, 2024
Manuel Faysse, a PhD student from CentraleSupélec & Illuin Technology and first author of a pivotal paper on document retrieval, discusses his innovative model, ColPali. He shares the "Aha!" moment that inspired its creation and outlines the challenges faced in research. ColPali simplifies traditional retrieval systems using vision-language models, enhancing efficiency and relevance in document search. Manuel also compares ColPali with classic multimodal models, showcasing its superiority and potential for future applications.
Chapters
Transcript
Episode notes
1 2 3 4 5 6
Intro
00:00 • 4min
The Journey from Idea to Implementation in Document Retrieval
04:09 • 3min
Enhancing Information Retrieval with Multi-Vector Approaches
07:11 • 2min
Streamlining Document Retrieval with Golpali
09:19 • 2min
Innovating Image Queries with Vision-Language Models
11:24 • 20min
The Future of Document Retrieval with Vision-Language Models
31:26 • 3min