
Multi-modal Deep Learning for Complex Document Understanding with Doug Burdick - #541
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
00:00
Navigating PDF Data Extraction Challenges
This chapter explores the integration of natural language processing with document conversion, focusing on the difficulties of extracting data from PDFs, particularly for COVID-19 research. It discusses collaborative efforts to develop solutions for better machine consumption of documents while addressing the inherent limitations of the PDF format.
Transcript
Play full episode