
Weaviate Podcast
Join Connor Shorten as he interviews machine learning experts and explores Weaviate use cases from users and customers.
Latest episodes

29 snips
Oct 17, 2024 • 57min
Compound AI Systems with Philip Kiely - Weaviate Podcast #105!
Philip Kiely, the leading developer relations at Baseten, shares insights on compound AI systems and their evolution. He discusses breaking tasks into multiple stages for better AI model performance. The conversation covers advancements in multimodal AI and strategies for deploying these systems efficiently. Kiely emphasizes the benefits of smaller models and constrained generation techniques, alongside architectural tips for Kubernetes deployment. Key comparisons are made between various model serving frameworks, focusing on optimizing AI performance while minimizing costs.

7 snips
Sep 18, 2024 • 1h 1min
AI Agents That Matter with Sayash Kapoor and Benedikt Stroebl - Weaviate Podcast #104!
Sayash Kapoor and Benedikt Stroebl, co-first authors from Princeton Language and Intelligence, discuss their influential paper on AI agents. They explore the crucial balance between performance and cost in AI systems, emphasizing that amazing responses mean little if they are too expensive to produce. The duo introduces the DSPY framework to optimize accuracy and costs and debates the adapting challenges of AI benchmarks in dynamic environments. They also highlight the importance of human feedback in enhancing AI reliability and performance.

11 snips
Aug 28, 2024 • 1h 1min
MIPRO and DSPy with Krista Opsahl-Ong! - Weaviate Podcast #103
Krista Opsahl-Ong, a leading developer and scientist at Stanford University, is the mastermind behind MIPRO and DSPy. In this engaging discussion, she illuminates the world of Automated Prompt Engineering and its impact on language models. Krista dives into the challenges of manual prompt construction, highlighting innovative algorithms that streamline the process. The conversation also explores the intricacies of structured outputs, multi-stage language programs, and the revolutionary potential of Large Language Models in AI-driven solutions.

Aug 14, 2024 • 53min
AI-Native Development with Guy Podjarny and Bob van Luijt - Weaviate Podcast #102!
Guy Podjarny, co-founder of the successful cybersecurity firm Snyk, joins Bob van Luijt to dive into the world of AI-native development. They explore the critical differences between AI-native and AI-enabled applications, revolutionizing software development practices. Highlights include the shift from a code-centric approach to one focusing on user specifications, and how AI assists in the creative coding process. The duo also discusses the balance of trust and change in adopting AI, and the future of personalized media through innovative platforms like Tessle!

Jul 17, 2024 • 48min
Scaling Pandas with Devin Petersohn - Weaviate Podcast #101!
Devin Petersohn, creator of Modin and co-founder of Ponder, acquired by Snowflake, discusses optimizing pandas data frames, building Moden to match pandas API, query optimization in large language model systems, and handling CSV files efficiently in distributed systems on the Weaviate Podcast.

5 snips
Jul 4, 2024 • 44min
Generative UIs with Lucas Negritto and Bob van Luijt!
Former OpenAI member Lucas Negritto and Weaviate Co-founder Bob van Luijt discuss AI-native applications in a fascinating podcast. They explore building UIs within generative models, native multimodality, subjective feedback, and more. Topics include prototype creation, challenges of using natural language prompts, creativity in software writing, and the evolution of AI in game development.

7 snips
Jun 25, 2024 • 54min
ACORN with Liana Patel and Abdel Rodriguez - Weaviate Podcast #99!
Liana Patel is a Ph.D. student at Stanford University who is the lead author of ACORN, a breakthrough in Approximate Nearest Neighbor Search with Filters! Also joining the podcast is Abdel Rodriguez, a Vector Index Researcher and Engineer at Weaviate. This podcast dives into all sorts of details behind ACORN. Starting with how Liana developed her interest in Approximate Nearest Neighbor Search algorithms and then transitioning into how ACORN differs from previous approaches, the Two-Hop Neighborhood Heuristic, Predicate Subgraphs, Experimental Details, and many more topics! Major thank you to Liana and Abdel for joining the podcast, this was such a fun conversation packed with insights about Proximity Graph algorithms for Vector Search with Filtering!

Jun 19, 2024 • 59min
Window Search Tree with Josh Engels - Weaviate Podcast #98!
Josh Engels is a Ph.D. student at MIT who has published several works advancing the state of the art in Vector Search. Josh has recently developed the Window Search Tree, a new algorithm particularly targeted for improving Filtered Vector Search. Even more particularly than that, the WST algorithm targets Filtered Search with continuous-valued filters such as "price" or "date", also known as range filters. This is a huge application for Vector Databases and it was incredible getting to pick Josh's brain on how this works and the state of Approximate Nearest Neighbor Search!

18 snips
Jun 11, 2024 • 60min
The Future of Search with Nils Reimers and Erika Cardenas - Weaviate Podcast #97!
Join the 97th Weaviate Podcast with Nils Reimers and Erika Cardenas as they delve into AI-powered search technology. Topics include Compass embeddings, multi-aspect chunking, ColBERT, RAG Evaluation, and more. Discover the challenges and solutions in embeddings, LLMs, smart chunking strategies, evaluating chatbots, and optimizing vector databases with quantization methods.

Jun 5, 2024 • 1h 35min
Deep Learning with Letitia Parcalabescu - Weaviate Podcast #96!
Letitia Parcalabescu, a Ph.D. student at the University of Heidelberg and creator of the YouTube channel AI Coffee Break, shares insights on AI and content creation. She discusses the evolution of deep learning, emphasizing the shift from traditional models to transformers. The conversation dives into innovative techniques in algorithm optimization and the complexities of large language models. Letitia also highlights the new Mamba architecture and the future of multimodal AI, advocating for a balance between AI capabilities and human oversight.