The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) cover image

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Reasoning Over Complex Documents with DocLLM with Armineh Nourbakhsh - #672

Feb 19, 2024
Armineh Nourbakhsh, Executive Director at JP Morgan AI Research, dives into the exciting world of DocLLM, a layout-aware large language model designed for document understanding. She shares insights on the evolution of document AI, focusing on multimodal approaches that combine textual and visual data. Nourbakhsh discusses the challenges of training generative models, the intricacies of processing enterprise documents, and strategies to reduce hallucinations in language models, enhancing performance in complex document analysis.
45:38

Podcast summary created with Snipd AI

Quick takeaways

  • DocLLM integrates textual semantics and spatial layout for processing complex documents.
  • Armineh emphasizes the importance of instruction tuning and future directions for DocLLM's development.

Deep dives

Armina's Background and Introduction to NLP

Armina shares her background in Unimodal NLP and how she got into AI research. She accidentally ended up working on sentiment analysis and later transitioned to document AI.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner