Weaviate Podcast

Weaviate

Join Connor Shorten as he interviews machine learning experts and explores Weaviate use cases from users and customers.

Episodes

Mentioned books

Dec 8, 2025 • 1h 1min

Pyversity with Thomas van Dongen - Weaviate Podcast #132!

Thomas van Dongen is the head of AI engineering at Springer Nature and the creator of Pyversity! Pyversity is a fast, lightweight open-source Python library for diversifying retrieval results. Retrieval systems often return highly similar items. Pyversity efficiently re-ranks these results to encourage diversity, surfacing items that remain relevant but less redundant. It implements several popular diversification strategies such as MMR, MSD, DPP, and Cover with a clear, unified API.

Nov 18, 2025 • 1h 2min

Semantic Query Engines with Matthew Russo - Weaviate Podcast #131!

Matthew Russo, a Ph.D. student at MIT, dives into the world of semantic query processing engines and their potential to revolutionize database systems. He discusses the emergence of semantic operators like AI_WHERE and their role in transforming how we handle unstructured data. With insights on optimizing query planning and the benefits of filtering order, Matthew also introduces SemBench, a crucial standardized benchmark for evaluating semantic queries. Expect a lively exploration of the future of AI in databases and practical optimization strategies!

Nov 3, 2025 • 60min

REFRAG with Xiaoqiang Lin - Weaviate Podcast #130!

Xiaoqiang Lin, a Ph.D. student at the National University of Singapore and former Meta researcher, dives into the innovative REFRAG method for enhancing retrieval-augmented generation. He explains how REFRAG improves LLM inference speeds, making Time-To-First-Token 31x faster. The discussion also covers multi-granular chunk embeddings, performance trade-offs in compression, and the exciting future of agentic AI. Listeners will learn about the balance between data and architecture for long-context capabilities and the practical compute requirements for training.

Oct 13, 2025 • 44min

Weaviate and SAS with Saurabh Mishra and Bob van Luijt - Weaviate Podcast #129!

In this conversation, Saurabh Mishra, a Senior product/engineering leader at SAS, discusses the exciting partnership between SAS and Weaviate on the SAS Retrieval Agent Manager. He explores how retrieval-augmented generation is transforming enterprise AI, particularly for managing unstructured data. Saurabh highlights real-world use cases, including predictive maintenance in manufacturing, and addresses persistent challenges in data security and AI trustworthiness. He also shares insights on the evolving developer experience and the promising future of SAS RAM.

Sep 22, 2025 • 1h 2min

Weaviate's Query Agent with Charles Pierse - Weaviate Podcast #128!

Charles Pierse, Director of Weaviate Labs, shares insights on the GA release of the Weaviate Query Agent. He discusses the journey from beta to GA, highlighting unexpected lessons and team collaborations. The conversation dives into technical aspects, including response models, citations, and how Search Mode enhances retrieval. Charles explains how the Query Agent integrates with the Cloud Console, making it intuitive for users. He also presents a compelling case study featuring MetaBuddy's innovative use of the agent for nutrition data.

Aug 13, 2025 • 1h 2min

GEPA with Lakshya A. Agrawal - Weaviate Podcast #127!

Lakshya A. Agrawal, a Ph.D. student at U.C. Berkeley, discusses his groundbreaking work on GEPA, an innovative optimizer using Large Language Models (LLMs). He elaborates on three key innovations: Pareto-Optimal Candidate Selection, Reflective Prompt Mutation, and System-Aware Merging. Lakshya explores how these techniques enhance AI efficiency, the importance of incorporating domain knowledge, and the role of benchmarks like LangProBe. He also delves into the future of AI in scientific simulations and the advantages of merging language-based learning with traditional methods.

Jul 9, 2025 • 1h 5min

Agentic Topic Modeling with Maarten Grootendorst - Weaviate Podcast #126!

Maarten Grootendorst, a psychologist turned AI engineer known for creating BERTopic, dives into the exciting world of agentic topic modeling. He discusses how large language models (LLMs) are revolutionizing the way we extract and categorize topics from complex data. The conversation highlights the challenges of traditional vs. LLM-driven methods and the critical role of human feedback. Maarten also emphasizes the importance of modularity in BERTopic, allowing for adaptive and efficient topic exploration tailored to user needs.

Jul 2, 2025 • 51min

Sufficient Context with Hailey Joren - Weaviate Podcast #125!

In this installment, Hailey Joren, a Ph.D. student at UCSD, shares her groundbreaking insights on retrieval augmented generation systems. She sheds light on the crucial difference between relevant search results and 'sufficient context' for accurate answers. With her team's innovative autorater, they tackle the future of AI, addressing how current models struggle with hallucinations. Expect discussions on fine-tuning methodologies, the role of context in AI responses, and the exciting prospects of enhancing model reliability and interpretability.

Jun 25, 2025 • 1h 5min

RAG Benchmarks with Nandan Thakur - Weaviate Podcast #124!

Nandan Thakur, a Ph.D. student at the University of Waterloo, dives deep into Retrieval-Augmented Generation (RAG) and its significant benchmarks like BEIR and MIRACLE. He discusses the evolution of embedding models and the balance between specialization and generalization. The conversation highlights advancements in query decomposition, emphasizing new methods for complex user queries. Nandan also explores the complexities of summarizing AI search results and the importance of nuanced evaluations in RAG benchmarks for real-world applications.

May 28, 2025 • 1h 13min

MUVERA with Rajesh Jayaram and Roberto Esposito - Weaviate Podcast #123!

Rajesh Jayaram, a senior research scientist at Google and first author of the MUVERA algorithm, joins Roberto Esposito from Weaviate to discuss innovative multi-vector retrieval. They explore how MUVERA's compression techniques significantly reduce storage needs while maintaining accuracy. Topics include the advantages of contextualized token embeddings, Locality-Sensitive Hashing in topic modeling, and the challenges of benchmarking advanced retrieval systems. Their fascinating insights offer a glimpse into the future of AI and efficient data representation.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

App store banner

Play store banner