How AI Is Built  cover image

How AI Is Built

Chunking for RAG: Stop Breaking Your Documents Into Meaningless Pieces | S2 E20

Jan 3, 2025
Brandon Smith, a research engineer at Chroma known for his extensive work on chunking techniques for retrieval-augmented generation systems, shares his insights on optimizing semantic search. He discusses the common misconceptions surrounding chunk sizes and overlap, highlighting the challenges of maintaining context in dense content. Smith criticizes existing strategies, such as OpenAI's 800-token chunks, and emphasizes the importance of coherent parsing. He also introduces innovative approaches to enhance contextual integrity in document processing, paving the way for improved information retrieval.
49:13

Episode guests

Podcast summary created with Snipd AI

Quick takeaways

  • Achieving accuracy in semantic search is challenging due to complexities in chunking techniques that can cause significant information loss.
  • Traditional chunking methods often prioritize efficiency over contextual clarity, risking the loss of critical details in information retrieval processes.

Deep dives

The Complexity of Semantic Search

Semantic search may seem simplistic, but achieving accuracy in implementation is a complex challenge. While it's straightforward to set up and operate, fine-tuning it to produce reliable results can be extremely difficult. Specific issues arise, particularly with chunking techniques, which can lead to significant information loss if not executed correctly. For example, using a single long chunk to represent dense content can omit critical details, akin to trying to compress an entire Wikipedia page into a tweet.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode