Weaviate Podcast cover image

Weaviate Podcast

Latest episodes

undefined
Jul 2, 2025 • 51min

Sufficient Context with Hailey Joren - Weaviate Podcast #125!

In this installment, Hailey Joren, a Ph.D. student at UCSD, shares her groundbreaking insights on retrieval augmented generation systems. She sheds light on the crucial difference between relevant search results and 'sufficient context' for accurate answers. With her team's innovative autorater, they tackle the future of AI, addressing how current models struggle with hallucinations. Expect discussions on fine-tuning methodologies, the role of context in AI responses, and the exciting prospects of enhancing model reliability and interpretability.
undefined
8 snips
Jun 25, 2025 • 1h 5min

RAG Benchmarks with Nandan Thakur - Weaviate Podcast #124!

Nandan Thakur, a Ph.D. student at the University of Waterloo, dives deep into Retrieval-Augmented Generation (RAG) and its significant benchmarks like BEIR and MIRACLE. He discusses the evolution of embedding models and the balance between specialization and generalization. The conversation highlights advancements in query decomposition, emphasizing new methods for complex user queries. Nandan also explores the complexities of summarizing AI search results and the importance of nuanced evaluations in RAG benchmarks for real-world applications.
undefined
4 snips
May 28, 2025 • 1h 13min

MUVERA with Rajesh Jayaram and Roberto Esposito - Weaviate Podcast #123!

Rajesh Jayaram, a senior research scientist at Google and first author of the MUVERA algorithm, joins Roberto Esposito from Weaviate to discuss innovative multi-vector retrieval. They explore how MUVERA's compression techniques significantly reduce storage needs while maintaining accuracy. Topics include the advantages of contextualized token embeddings, Locality-Sensitive Hashing in topic modeling, and the challenges of benchmarking advanced retrieval systems. Their fascinating insights offer a glimpse into the future of AI and efficient data representation.
undefined
7 snips
May 15, 2025 • 1h 1min

Patronus AI with Anand Kannappan - Weaviate Podcast #122!

Anand Kannappan, co-founder of Patronus AI, dives into the challenges of debugging complex AI agents. He introduces Percival, a game-changing tool that analyzes agent traces and identifies failures. Anand explains critical issues like 'context explosion' and the orchestration of multi-agent systems. The conversation shifts to the evolving landscape of AI evaluation, advocating for dynamic oversight over static methods. He envisions a future where AI systems monitor each other, providing insights on how to enhance agent performance and evaluation.
undefined
May 12, 2025 • 54min

Haize Labs with Leonard Tang - Weaviate Podcast #121!

Leonard Tang, co-founder of Haize Labs, delves into innovative techniques for AI evaluation. He shares how stacking weaker models can enhance the performance of stronger ones through the revolutionary Verdict library, boasting a 10-20% improvement over traditional models. The conversation includes practical insights on creating contrastive evaluation sets and implementing debate-based judging systems. Tang discusses the balance between AI safety and user feedback, offering transformative strategies to ensure that AI systems meet enterprise needs effectively.
undefined
5 snips
May 7, 2025 • 56min

Box AI with Ben Kus and Bob van Luijt

Ben Kus, CTO of Box, delves into the complexities of the company's three-layer infrastructure: managing millions of interactions, navigating multi-tenant security challenges, and ensuring AI adheres to intricate content permissions. He discusses the impact of vector embeddings on file sizes and emphasizes the continued relevance of RAG despite advancements in context windows. The conversation also highlights Box's development of AI agents aimed at streamlining cumbersome enterprise processes, creating a path to improved productivity in the workplace.
undefined
18 snips
Apr 9, 2025 • 1h 10min

Structured Outputs with Will Kurt and Cameron Pfiffer - Weaviate Podcast #119!

Join Will Kurt and Cameron Pfiffer, co-founders of .txt.ai, as they unveil the groundbreaking open-source library, Outlines. They discuss how constrained decoding enhances reliability in language model outputs, enabling capabilities like perfect JSON generation and guided reasoning. The duo shares insights on multitask inference, which boosts efficiency in AI systems, and the role of finite state machines in their innovations. Delve into practical applications, including knowledge graph creation and automated report generation, shaping the future of AI.
undefined
13 snips
Mar 25, 2025 • 1h 2min

Synthetic Data with David Berenstein and Ben Burtenshaw - Weaviate Podcast #118!

David Berenstein and Ben Burtenshaw from Hugging Face dive into the fascinating world of synthetic data generation. They discuss innovative methodologies like persona-driven data and integration tactics for enhancing quality and diversity. The duo highlights the importance of tools like DistilLabel and Argilla for smooth data augmentation and model fine-tuning. Excitingly, they explore the potential for synthetic image data and its impact on AI education, emphasizing accessibility and user-friendly solutions in AI's future.
undefined
30 snips
Mar 3, 2025 • 58min

Letta AI with Sarah Wooders - Weaviate Podcast #117!

In this captivating conversation, Sarah Wooders, co-founder and CTO of Letta AI, shares her revolutionary insights from the Berkeley Sky Computing Lab. She discusses the development of stateful AI agents that remember interactions, emphasizing the importance of memory management. Topics include context optimization, the evolution of AI personas, and innovative tools for enhancing developer experiences. Sarah also explores the integration of AI in coding workflows, shedding light on the future of conversational AI and its profound implications for tech.
undefined
17 snips
Feb 27, 2025 • 52min

Agent Experience with Matt Biilmann, Sebastian Witalec, and Charles Pierse - Weaviate Podcast #116!

Matt Biilmann, Co-founder and CEO of Netlify, brings his expertise in web platforms, joined by Sebastian Witalec from Weaviate. They dive into the fascinating concept of 'Agent Experience' and how it reshapes web development. The trio discusses the evolution of APIs with AI integration and the importance of tailored communication methods for agents. They also explore the challenges in designing user experiences for both developers and AI agents, emphasizing the need for open standards to enhance interactions and streamline workflows.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app