Weaviate Podcast cover image

Weaviate Podcast

Latest episodes

undefined
11 snips
Feb 19, 2025 • 1h

Optimizing Retrieval Agents with Shirley Wu - Weaviate Podcast #115!

Shirley Wu, a PhD student at Stanford University, delves into AI agents and retrieval systems, bringing expertise from her work on the Avatar Optimizer and STaRK Benchmark. She describes how the Avatar Optimizer enhances LLM tool usage through contrastive reasoning and iterative feedback. The discussion also tackles the STaRK Benchmark's role in evaluating retrieval systems, highlighting challenges like unifying textual and relational data, multi-vector embeddings, and the future of human-centered language models in various applications.
undefined
Feb 12, 2025 • 58min

Contextual AI with Amanpreet Singh - Weaviate Podcast #114!

Amanpreet Singh, Co-Founder and CTO of Contextual AI, dives into the revolutionary world of Retrieval Augmented Generation (RAG) 2.0. He discusses the seamless integration of generative and retrieval models and the challenges of prompt engineering. Amanpreet emphasizes the necessity of continual learning and updates to model weights to resolve knowledge conflicts. The conversation also highlights the potential of reinforcement learning algorithms and the importance of domain-specific data. Buckle up for insights on the future of AI and specialized agents!
undefined
Jan 28, 2025 • 54min

Cartesia AI with Karan Goel - Weaviate Podcast #113!

Hey everyone! Thank you so much for watching the 113th episode of the Weaviate Podcast with Karan Goel from Cartesia AI! Cartesia AI is leading the AI world in text-to-speech models! As exciting as these new applications in speech generation are, Cartesia is also building around an incredibly exciting new neural network architecture that cuts across all of AI -- State Space Models. State Space Models (SSMs) present a new approach to modeling long sequences circumventing the quadratic attention bottlenecks of transformers. In the podcast, we discuss Karan's perspectives around end-to-end modeling, long context and Multimodal processing, building and deploying a new kind of model, and more! I hope you find the podcast interesting and useful! As always more than happy to discuss these ideas further with you! Thanks for listening!
undefined
14 snips
Jan 15, 2025 • 58min

Google Vertex AI RAG Engine with Lewis Liu and Bob van Luijt - Weaviate Podcast #112!

Bob van Luijt, Co-founder of Weaviate, and Lewis Liu, Product Manager at Google Cloud, dive into the new Vertex AI RAG Engine. They discuss the evolution of knowledge representation, from semantic webs to modern AI applications. Bob shares insights on the challenges of re-indexing and embeddings, while Lewis highlights bottlenecks in data ingestion. The duo explore the potential of Generative Feedback Loops and Agentic Architectures in enhancing AI systems, as well as the complexities of integrating prompts and external tools. It's a fascinating discussion on the future of AI!
undefined
Jan 8, 2025 • 53min

Morningstar Intelligence Engine with Aravind Kesiraju - Weaviate Podcast #111!

Join Aravind Kesiraju, Principal Software Engineer at Morningstar, as he shares insights on the development of the Morningstar Intelligence Engine. They discuss the fascinating world of no-code/low-code AI applications and how to build advanced financial chatbots. Discover the intricacies of integrating diverse data sources and optimizing language models for financial tasks. Explore the evolution of Retrieval-Augmented Generation (RAG) data pipelines and the challenges of managing sensitive financial information while enhancing chatbot performance with intelligent question classification.
undefined
8 snips
Dec 18, 2024 • 1h 34min

Arctic Embed with Luke Merrick, Puxuan Yu, and Charles Pierse - Weaviate Podcast #110!

Join Luke Merrick from Snowflake, a key player in Arctic Embed development, and Charles Pierse, head of Weaviate Labs, as they dive into the intricacies of multilingual text embeddings. They explore the evolution of Arctic Embed 2.0, emphasizing its open-source nature. The conversation covers technical strategies in model training, the economics of pre-training large models, and the challenges of integrating negative examples. They discuss the delicate balance between model simplicity and nuance in retrieval, promoting collaboration to enhance search quality.
undefined
12 snips
Nov 13, 2024 • 34min

Agentic RAG with Erika Cardenas - Weaviate Podcast #109!

In this engaging discussion, Erika Cardenas, Technology Partner Manager at Weaviate, dives deep into the innovative world of Agentic RAG systems. She explains how Agentic RAG outperforms traditional approaches by enhancing complex querying and reasoning. The conversation explores the importance of memory in AI, the evolution of multi-agent systems, and the role of generative feedback loops in advancing AI capabilities. Erika also emphasizes the necessity of human oversight in AI, underscoring collaborative approaches between machines and human input.
undefined
Nov 7, 2024 • 40min

Let Me Speak Freely? with Zhi Rui Tam - Weaviate Podcast #108!

Zhi Rui Tam, an expert in large language models and the lead author of "Let Me Speak Freely?" dives into the impact of JSON structured outputs on AI performance. He discusses innovative prompting techniques to enhance model generation and explores the trend of ensemble inference strategies. Tam contrasts open-source models with black box APIs, emphasizing the importance of privacy. The conversation also touches on the significance of structured programming outputs and future implications for efficient AI planning.
undefined
Oct 30, 2024 • 58min

SWE-bench with John Yang and Carlos E. Jimenez - Weaviate Podcast #107!

In a fascinating discussion, John Yang from Stanford and Carlos E. Jimenez from Princeton, co-first authors of the SWE-bench papers, delve into the revolutionary SWE-bench project. They explore how AI enhances software engineering, addressing the challenges of integrating language models for coding tasks. The duo discusses resource allocation for software engineering agents in Docker and Kubernetes, and the future of AI in business, including potential advancements in virtual reality. Their insights reveal how AI can reshape the development landscape.
undefined
Oct 22, 2024 • 51min

AI in Education with Rose E. Wang - Weaviate Podcast #106!

Rose E. Wang, a Ph.D. student at Stanford University, dives into her groundbreaking research on AI in education, particularly through the Tutor CoPilot project. She discusses one of the largest randomized control trials in this field, involving 900 students and 1800 tutors. Rose highlights the innovative blend of human expertise and AI, revealing how tools like Cursor enhance real-time tutoring experiences. She also addresses challenges in traditional education, the evolving role of AI, and the vital need for effective human-AI interactions in learning environments.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode