The Hard Truth About RAG in Production (ft Kirk Marple)

11 snips

Feb 4, 2025

Kirk Marple, Founder and CEO of Graphlit, shares his expertise on building production-ready RAG systems. He dives into the significance of knowledge graphs and effective reranking strategies. Kirk emphasizes the importance of scaling RAG beyond basic implementations and reveals the critical differences between demo projects and production systems in AI. The conversation highlights challenges and best practices for developers, offering valuable insights on enhancing language models through sophisticated data retrieval techniques.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

INSIGHT

RAG Explained

Retrieval Augmented Generation (RAG) enhances LLMs by incorporating external data into prompts.
This allows LLMs to access information beyond their training data, enabling more accurate and context-aware responses.

ADVICE

Chunking Strategies

Don't overthink chunk size; defaults with overlap work well.
Focus on retrieval expansion and re-ranking for relevance.

INSIGHT

Metadata and Embeddings

Metadata in vector embeddings can create false positives.
A hybrid approach using separate JSON metadata stores is more effective.

Get the Snipd Podcast app to discover more snips from this episode

Get the app