3min chapter

Practical AI: Machine Learning, Data Science, LLM cover image

What's up, DocQuery?

Practical AI: Machine Learning, Data Science, LLM

CHAPTER

Ocr

We do a bunch of pre processing upfront that basically normalizes anything you upload into a fairly consistent data structure. What we do is we take almost any file you could throw at the system, anything from PDF files to emails,. HTML files, scanned images, pictures from your phone, just about anything. And wenormalize it into a bunch of pixels, a bunch of text, and a bunch of bounding boxes that tell you where the pieces of text are as well as a few other things.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode