4min chapter

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) cover image

Multi-modal Deep Learning for Complex Document Understanding with Doug Burdick - #541

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

CHAPTER

Using Deep Learning to Identify Tables and Cells

The two big steps are what we call document conversion, and then we'll say the document understanding. Within that first step of conversion, which is a pipe line by itself, there will be a step where we actually just think of it as basically pulling the basic data out of the p d f document. There's another process after this that is actually taking theot put of the object detection. So i have the boxes for here's the table, and here's the cells on the page. How do i actually turn that into a table structure? That is the output of the table extraction step. Then that table extraction step goes back into a a nal document. And then that text format

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode