Weaviate Podcast cover image

Weaviate Podcast

Latest episodes

undefined
Nov 7, 2023 • 57min

Simba Khadder on FeatureForm - Weaviate Podcast #74!

Hey everyone! Thank you so much for watching the 74th Weaviate Podcast feature Simba Khadder, the CEO and Co-Founder of FeatureForm! To begin, "features" broadly describe the inputs to machine learning models that they use to produce outputs, or predictions. Feature stores orchestrate the construction of features, whether that be transformations for tabular machine learning models such as XGBoost, to chunking for vector embedding inference, and now features for LLM inference in RAG. Right out of the gate, Simba really opened my eyes to the role that feature engineering plays in RAG. Further touching on this at the very end under the "Exciting future for RAG with Features" chapter, Simba further describes how we can use more advanced features to provide better context to LLMs. In addition to these insights on RAG, there are so many nuggets in the podcast, Simba is a world class professional when it comes to building distributed systems, production scale recommendation systems, and more! I learned so much from chatting with Simba, I hope you enjoy listening to the podcast! As always we are more than happy to answer any questions or discuss any ideas you have about the content in the podcast! FeatureForm: https://www.featureform.com/ Highly Recommend!! Simba Khadder at the CMU DB Seminar series: https://www.youtube.com/watch?v=ZsWa6XiBc-U FeatureForm and Weaviate demo! https://docs.featureform.com/providers/weaviate Chapters 0:00 Simba Khadder 0:35 RAG and Feature Stores 4:30 Experience building Recommendation Systems 9:47 The End-to-End Feature Lifecycle 15:08 Virtual Feature Store Orchestration 26:45 RAG Evaluation 31:27 Feature Engineering 34:15 LLM Tuning and Features 39:55 Streaming Features 51:15 Data Drift Detection 54:20 Exciting future for RAG with Features
undefined
6 snips
Nov 6, 2023 • 52min

Charles Packer on MemGPT - Weaviate Podcast #73!

Charles Packer, lead author of MemGPT at UC Berkeley, discusses the concept of explicit memory management in GPT models, the use of prompts to handle memory limitations, interrupts in retrieval augmented generation (RAG), achieving ideal running speed in high parameter models, fine-tuning MemGBT for long conversations, search actions pagination, role-playing language models, and the future integration of memory in chatbot platforms.
undefined
Nov 1, 2023 • 50min

Madelon Hulsebos on Tabular Machine Learning - Weaviate Podcast #72!

Hey everyone! Thank you so much for watching the 72nd episode of the Weaviate Podcast with Madelon Hulsebos!! Madelon is one of the world's experts on Machine Learning with Tables and Tabular-Structured Data, this was such an eye-opening conversation! We discussed all sorts of topics from the relationship of tabular data and embeddings, to searching through tables, semantic joins, more complex Text-to-SQL, using machine learning for query execution, using tabular data in search and recommendation reranking, and many more! This was easily one of the most knowledge packed episodes of the Weaviate podcast so far, please don't hesitate to leave any questions or ideas you have related to the content discussed! You can learn more about Madelon's incredible research career and publications / talks here: https://www.madelonhulsebos.com/! Papers such as GitTables are listed here! Another nice nugget form the podcast - Madelon introduced me to the BIRD-SQL benchmark which really expanded my understanding of Text-to-SQL (https://arxiv.org/pdf/2305.03111.pdf. Chapters 0:00 Welcome Madelon! 0:58 Tabular Data and Embeddings 3:10 Tabular Representation Learning 5:48 Semantic Type Detection 9:50 Pandas as an LLM Tool 11:52 Table-Based Question Answering and Text-to-SQL 19:35 Joins with Machine Learning 21:38 Query Execution with Machine Learning 22:45 Graph Neural Networks 24:07 XGBoost 28:28 Merging Tables 32:10 Fact Representation 35:50 GPT-4V and Tables 39:00 Metadata in Embeddings 42:45 Table Retrieval in Weaviate 46:25 Exciting future directions!!
undefined
Oct 26, 2023 • 56min

Vibs Abhishek on Alltius AI - Weaviate Podcast #71!

Hey everyone! Thank you so much for watching the 71st Weaviate Podcast with Vibs Abhishek! Vibs is the CEO and Founder of Alltius AI, as well as a professor at UC Irvine business school! In order to tame the somewhat chaotic emerging landscape of RAG and LLM applications, Alltius has settled on 3 core pillars of Knowledge, Skills, and Deployment Channels! Vibs further explained how he sees the distinction between Assistants and Agents and many more topics important to Enterprise deployment of RAG applications such as reducing hallucinations and employing classifiers to route skills and knowledge sources! I learned so much from this conversation, I hope you enjoy the podcast! Alltius KNO Plus Demo Video: https://www.loom.com/share/fcfe516b75ea4f069b1a8d6a3510fa4c?sid=5f43317f-c20b-4dd9-91d3-2cde993fd91f Chapters 0:00 Welcome Vibs 0:22 Background 2:30 Alltius’ UI for Assistants 7:15 The Knowledge Pillar 12:05 SQL Router and Intent Management 14:10 Classifying a Pipeline / Skill 17:30 Flexibility of Zero-Shot versus Fine-Tuning 21:00 The Channels Pillar 23:00 Connecting the Warehouse / Lakehouse 24:50 Assistant versus Agent 28:30 MemGPT 31:25 Offline LLM Research 35:50 Multi-Agent Role-Playing Assistants 39:25 From Clicks to Conversations 44:10 CEO / Professor and Evolution of the Field
undefined
Oct 24, 2023 • 31min

MemGPT Explained!

Discover the innovative world of MemGPT, where operating system principles meet large language models. Explore how memory management is revolutionized to enhance conversational AI. Delve into the architecture that boosts dialogue consistency and engagement. Unpack the challenges of training long-context models and the role of efficient memory in search dynamics. Learn about the creation of synthetic textbooks as training data, showcasing the seamless interaction of language models and APIs.
undefined
Oct 18, 2023 • 55min

Kevin Cohen on Neum AI - Weaviate Podcast #70!

Hey everyone! Thank you so much for watching the 70th episode of the Weaviate podcast with Neum AI CTO and Co-Founder Kevin Cohen! I first met Kevin when he was debugging an issue with his distributed node utilization and have since learned so much from him about how he sees the space of Data Ingestion, also commonly referenced as ETL for LLMs! There are so many interesting parts to this from the general flow of data connectors, chunkers and metadata extractors, embedding inference, and the last leg of the mile of importing the vectors to a Vector DB such as Weaviate! I really loved how Kevin broke down the distributed messaging queue and system design for orchestrating data ingestion at massive scale such as dealing with failures and optimizing the infrastructure as code setup. We also discussed things like new use cases with quadrillion scale vector indexes and the role of knowledge graphs in all this! I really hope you enjoy the podcast, please check out this amazing article below from Neum AI! https://medium.com/@neum_ai/retrieval-augmented-generation-at-scale-building-a-distributed-system-for-synchronizing-and-eaa29162521 Chapters 0:00 Check this out! 1:18 Welcome Kevin! 1:58 Founding Neum AI 6:55 Data Ingestion, End-to-End Overview 9:10 Chunking and Metadata Extraction 14:20 Embedding Cache 16:57 Distributed Messaging Queues 22:15 Embeddings Cache ELI5 25:30 Customizing Weaviate Kubernetes 38:10 Multi-Tenancy and Resource Allocation 39:20 Billion-Scale Vector Search 45:05 Knowledge Graphs 52:10 Y Combinator Experience
undefined
Oct 4, 2023 • 1h 9min

Charles Pierse on Tactic Generate - Weaviate Podcast #69!

Hey everyone! Thank you so much for watching the 69th episode of the Weaviate Podcast featuring Charles Pierse from Tactic! Tactic has recently launched their new Tactic Generate project, an incredible UI for conducting research across multiple documents. I think there is a massive opportunity to pair these prompts and LLM workflows with User Interfaces and take more of a holistic User Experience perspective. Tactic Generate has done an incredible job of that, please take a look from the link below! I had such a fun conversation catching up with Charles (Charles was our 2nd Weaviate Podcast guest!), I hope you enjoy the podcast! Tactic Generate: https://tactic.fyi/generative-insights/ Chapters 0:00 Tactic Generate 1:40 Welcome Charles! 2:38 Charles’ work at Tactic 4:40 LLMs comparing documents 9:10 LLM Chaining 17:30 Discovering LLM Chains 20:28 Moats in ML Products 28:48 Fine-Tuning vs. RAG 34:30 Fine-Tuning Search Models 39:45 Skepticism on RLHF 41:52 Gorilla, Integrations, and CRM 45:40 Query Routers 47:55 CRM and Tree-of-Thoughts 55:54 Graph Embeddings 1:02:20 Llama CPP / GGML 1:04:28 What are you looking forward to most in AI?
undefined
Sep 20, 2023 • 52min

Weights and Biases on Fine-Tuning LLMs - Weaviate Podcast #68!

Hey everyone! Thank you so much for watching the 68th episode of the Weaviate Podcast! We are super excited to welcome Morgan McGuire, Darek Kleczek, and Thomas Capelle! This was such a fun discussion beginning with generally how see the space of fine-tuning from why you would want to do it, to the available tooling, intersection with RAG and more! Check out W&B Prompts! https://wandb.ai/site/prompts Check out the W&B Tiny Llama Report! https://wandb.ai/capecape/llamac/reports/Training-Tiny-Llamas-for-Fun-and-Science--Vmlldzo1MDM2MDg0 Chapters 0:00 Tiny Llamas! 1:53 Welcome! 2:22 LLM Fine-Tuning 5:25 Tooling for Fine-Tuning 7:55 Why Fine-Tune? 9:55 RAG vs. Fine-Tuning 12:25 Knowledge Distillation 14:40 Gorilla LLMs 18:25 Open-Source LLMs 22:48 Jonathan Frankle on W&B 23:45 Data Quality for LLM Training 25:55 W&B for Data Versioning 27:25 Curriculum Learning 29:28 GPU Rich and Data Quality 30:30 Vector DBs and Data Quality 32:50 Tuning Training with Weights & Biases 35:47 Training Reports 42:28 HF Collections and W&B Sweeps 44:50 Exciting Directions for AI
undefined
Sep 13, 2023 • 1h 1min

Farshad Farahbakhshian and Etienne Dilocker on Weaviate and AWS - Weaviate Podcast #67!

Hey everyone! Thank you so much for watching the 67th Weaviate Podcast, announcing Weaviate on the AWS Marketplace! This was one of my favorite podcasts to date with a deep dive on the details of running RAG applications in the cloud, our general understanding of LLM Fine-Tuning and RAG, as well as a really interesting discussion on VPCs and Hybrid SaaS! I hope you find the podcast useful, as always we are more than happy to answer any questions or discuss any ideas you have about the content presented in the podcast! Learn more here: https://aws.amazon.com/marketplace/seller-profile?id=seller-jxgfug62rvpxs As well as here: https://weaviate.io/developers/weaviate/installation/aws-marketplace Chapters 0:00 Welcome Farshad 0:38 Weaviate’s Journey to AWS 2:05 Retrieval-Augmented Generation and Vector DBs 3:44 Running AI in the Cloud 9:40 Fine-Tuning LLMs vs. RAG 10:30 Skill vs. Knowledge (Lawyer Example) 14:28 Continual Learning of LLMs 16:50 Searching through multiple sources 19:58 Hybrid Search controlled by LLMs 22:10 Classes versus Filters 25:00 SQL and Vector Search 25:55 Favorite RAG Use Cases 31:55 Cloud Benchmarking 37:00 Price Performance 38:20 Tuning HNSW 42:15 Horizontal Scalability on AWS Marketplace 47:00 Privacy Requirements 54:45 Weaviate Hybrid SaaS 59:00 AWS Marketplace
undefined
Sep 12, 2023 • 4min

Hybrid SaaS in Weaviate Explained!

Hey everyone! Here is a clip from our newest Weaviate podcast with Farshad Farahbakhshian, Gen AI specialist at AWS and Etienne Dilocker, CTO and Co-Founder of Weaviate! This podcast announces Weaviate on the AWS marketplace and is packed with info on running Weaviate in the cloud such as this clip explaining how Hybrid SaaS works! I hope you find the clip useful, we are more than happy to answer any questions you have about the content in this clip! Chapters 0:00 Quick Intro for Context 0:29 Etienne Dilocker on Hybrid SaaS

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app