

Weaviate Podcast
Weaviate
Join Connor Shorten as he interviews machine learning experts and explores Weaviate use cases from users and customers.
Episodes
Mentioned books

5 snips
May 7, 2025 • 56min
Box AI with Ben Kus and Bob van Luijt
Ben Kus, CTO of Box, delves into the complexities of the company's three-layer infrastructure: managing millions of interactions, navigating multi-tenant security challenges, and ensuring AI adheres to intricate content permissions. He discusses the impact of vector embeddings on file sizes and emphasizes the continued relevance of RAG despite advancements in context windows. The conversation also highlights Box's development of AI agents aimed at streamlining cumbersome enterprise processes, creating a path to improved productivity in the workplace.

18 snips
Apr 9, 2025 • 1h 10min
Structured Outputs with Will Kurt and Cameron Pfiffer - Weaviate Podcast #119!
Join Will Kurt and Cameron Pfiffer, co-founders of .txt.ai, as they unveil the groundbreaking open-source library, Outlines. They discuss how constrained decoding enhances reliability in language model outputs, enabling capabilities like perfect JSON generation and guided reasoning. The duo shares insights on multitask inference, which boosts efficiency in AI systems, and the role of finite state machines in their innovations. Delve into practical applications, including knowledge graph creation and automated report generation, shaping the future of AI.

13 snips
Mar 25, 2025 • 1h 2min
Synthetic Data with David Berenstein and Ben Burtenshaw - Weaviate Podcast #118!
David Berenstein and Ben Burtenshaw from Hugging Face dive into the fascinating world of synthetic data generation. They discuss innovative methodologies like persona-driven data and integration tactics for enhancing quality and diversity. The duo highlights the importance of tools like DistilLabel and Argilla for smooth data augmentation and model fine-tuning. Excitingly, they explore the potential for synthetic image data and its impact on AI education, emphasizing accessibility and user-friendly solutions in AI's future.

30 snips
Mar 3, 2025 • 58min
Letta AI with Sarah Wooders - Weaviate Podcast #117!
In this captivating conversation, Sarah Wooders, co-founder and CTO of Letta AI, shares her revolutionary insights from the Berkeley Sky Computing Lab. She discusses the development of stateful AI agents that remember interactions, emphasizing the importance of memory management. Topics include context optimization, the evolution of AI personas, and innovative tools for enhancing developer experiences. Sarah also explores the integration of AI in coding workflows, shedding light on the future of conversational AI and its profound implications for tech.

17 snips
Feb 27, 2025 • 52min
Agent Experience with Matt Biilmann, Sebastian Witalec, and Charles Pierse - Weaviate Podcast #116!
Matt Biilmann, Co-founder and CEO of Netlify, brings his expertise in web platforms, joined by Sebastian Witalec from Weaviate. They dive into the fascinating concept of 'Agent Experience' and how it reshapes web development. The trio discusses the evolution of APIs with AI integration and the importance of tailored communication methods for agents. They also explore the challenges in designing user experiences for both developers and AI agents, emphasizing the need for open standards to enhance interactions and streamline workflows.

12 snips
Feb 19, 2025 • 1h
Optimizing Retrieval Agents with Shirley Wu - Weaviate Podcast #115!
Shirley Wu, a PhD student at Stanford University, delves into AI agents and retrieval systems, bringing expertise from her work on the Avatar Optimizer and STaRK Benchmark. She describes how the Avatar Optimizer enhances LLM tool usage through contrastive reasoning and iterative feedback. The discussion also tackles the STaRK Benchmark's role in evaluating retrieval systems, highlighting challenges like unifying textual and relational data, multi-vector embeddings, and the future of human-centered language models in various applications.

21 snips
Feb 12, 2025 • 58min
Contextual AI with Amanpreet Singh - Weaviate Podcast #114!
Amanpreet Singh, Co-Founder and CTO of Contextual AI, dives into the revolutionary world of Retrieval Augmented Generation (RAG) 2.0. He discusses the seamless integration of generative and retrieval models and the challenges of prompt engineering. Amanpreet emphasizes the necessity of continual learning and updates to model weights to resolve knowledge conflicts. The conversation also highlights the potential of reinforcement learning algorithms and the importance of domain-specific data. Buckle up for insights on the future of AI and specialized agents!

10 snips
Jan 28, 2025 • 54min
Cartesia AI with Karan Goel - Weaviate Podcast #113!
Karan Goel, leading force at Cartesia AI, discusses groundbreaking advancements in text-to-speech technology and neural network architecture. He shares insights into State Space Models, designed to overcome traditional model limitations. The conversation dives into the evolution of long context processing and the importance of emotional intelligence in AI communications. Karan also highlights the significance of many-shot in-context learning and its applications in education, as well as the development of user-friendly on-device models.

26 snips
Jan 15, 2025 • 58min
Google Vertex AI RAG Engine with Lewis Liu and Bob van Luijt - Weaviate Podcast #112!
Bob van Luijt, Co-founder of Weaviate, and Lewis Liu, Product Manager at Google Cloud, dive into the new Vertex AI RAG Engine. They discuss the evolution of knowledge representation, from semantic webs to modern AI applications. Bob shares insights on the challenges of re-indexing and embeddings, while Lewis highlights bottlenecks in data ingestion. The duo explore the potential of Generative Feedback Loops and Agentic Architectures in enhancing AI systems, as well as the complexities of integrating prompts and external tools. It's a fascinating discussion on the future of AI!

Jan 8, 2025 • 53min
Morningstar Intelligence Engine with Aravind Kesiraju - Weaviate Podcast #111!
Join Aravind Kesiraju, Principal Software Engineer at Morningstar, as he shares insights on the development of the Morningstar Intelligence Engine. They discuss the fascinating world of no-code/low-code AI applications and how to build advanced financial chatbots. Discover the intricacies of integrating diverse data sources and optimizing language models for financial tasks. Explore the evolution of Retrieval-Augmented Generation (RAG) data pipelines and the challenges of managing sensitive financial information while enhancing chatbot performance with intelligent question classification.


