Weaviate Podcast

Weaviate

Join Connor Shorten as he interviews machine learning experts and explores Weaviate use cases from users and customers.

Episodes

Mentioned books

Jul 13, 2023 • 1h 39min

Charles Frye on Full Stack Deep Learning - Weaviate Podcast #57!

Hey everyone! Thank you so much for watching the 57th Weaviate podcast with Charles Frye! Charles is an educator at Full Stack Deep Learning, one of the world's top courses on Deep Learning with lectures available on YouTube (link below)! This was one of the most thorough Weaviate podcasts published so far, covering all sorts of topics around the evolution of Deep Learning! Particularly we discussed the Retrieval-Augmented Generation stack with Vector Databases and Zero-Shot Large Language Models and how that compares to more conventional machine learning workflows and the MLOPs stack! I really enjoyed chatting with Charles and am more than happy to answer any questions or discuss any ideas you have about the content in the podcast! Thank you so much for listening! Check out Full Stack Deep Learning! https://fullstackdeeplearning.com/ Full Stack Deep Learning on YouTube! https://www.youtube.com/@The_Full_Stack Chapters 0:00 Welcome Charles Frye! 0:52 Charles’ journey into Deep Learning 3:00 Weights & Biases and MLOps 5:30 Retrieval-Augmented Generation Stack 8:58 Data Engines and AI Products 13:50 Fine-Tuning 16:35 Information Retrieval Techniques 20:10 RAG as Tool Use and RETRO 23:33 Gorilla and Fine-Tuned Tool Use 27:36 Text-to-SQL Tool Use 30:46 Generative Data Augmentation 33:05 LLM generated queries for embeddings 38:04 Long-Tail and Data Imbalance 41:45 LoRA LLM Fine-Tuning 44:50 Eigenvectors and Disentaglement 50:00 LLM for Each User 55:00 Embedding Visualization and ML Observability 58:40 GPU Utilization 1:05:05 Discord Q&A Bot App 1:16:10 Data Schema Design 1:21:25 Graph and Vector Databases 1:28:35 Future Directions in AI

Jul 12, 2023 • 1h 3min

Etienne Dilocker on Weaviate 1.20 - Weaviate Podcast #56!

Chapters 0:00 Weaviate 1.20!!! 0:40 Multi-Tenancy 35:36 PQ Rescoring 47:20 Re-Ranking, AutoCut, Rank Fusion 58:58 Cloud Monitoring Metrics

Jul 5, 2023 • 1h 7min

Aleksa Gordcic - Weaviate Podcast #55!

Hey everyone! Thank you so much for watching the 55th episode of the Weaviate Podcast with Aleksa Gordcic! This episodes dives into Aleksa's incredible story from Deep Learning YouTube to DeepMind and now creating Ortus! We dived into all sorts of topics, I loved hearing about the latest updates on Ortus and how Aleksa is sees the current state of AI development! We are more than happy to answer any questions or discuss any ideas you might have about the content in the podcast! Thanks so much for watching! Check out Ortus here! - https://www.ortusbuddy.ai/welcome Chapters 0:00 Introduction 1:08 Deep Learning YouTube 5:40 DeepMind 9:40 Ortus 19:50 LangChain and LlamaIndex 23:10 Software 2.0 and Full Stack DL 29:20 Training Embedding Models 32:23 Text Chunking for Vector DBs 34:35 Visual Information in YouTube 38:15 Simulating Conversations 42:46 Aidan Gomez Quote on Synthetic Data 44:40 Tree of Thoughts 47:40 New Ortus Features 49:00 Embedding Marketplace 54:00 Personal Organization

Jun 22, 2023 • 56min

Stephanie Horbaczewski and Gunjan Bhattarai on Vody - Weaviate Podcast #53!

Chapters 0:00 Introduction 0:38 Founding Story of Vody 8:15 Custom Embedding Models 12:42 Movie Genre Vectors 13:42 Classification and Contrastive Learning 15:45 Foundation Model Tuning 21:13 Multimodal Generative Models 25:08 Training Embedding Models 33:20 Tabular Data Ranking Models 36:00 RoomGPT 41:36 Diversity in Recommendations 48:25 Future Directions in Multimodal AI 51:15 Open-Source 55:45 Keeping up with Vody!

Jun 14, 2023 • 42min

Yana Welinder on Kraftful - Weaviate Podcast #52!

Hey everyone, thank you so much for watching the 52nd episode of the Weaviate Podcast with Yana Welinder! Yana is the Founder and CEO of Kratful (https://www.kraftful.com/). Kratful is an incredibly interesting "ChatGPT but for Product Research" -- curating specific skills for Product Managers into a collection of prompts. We discussed all sorts of things from the latest innovations in LLMs to the ChatGPT marketplace and product management, I really hope you enjoy the podcast!

Jun 7, 2023 • 55min

Greg Kamradt and Colin Harmon on LLM Agents - Weaviate Podcast #51

Hey everyone, thank you so much for watching the 51st episode of the Weaviate Podcast with Greg Kamradt and Colin Harmon! Greg and Colin are both entrepreneurs in the space of new AI tools powered by LLMs! This podcast is about keeping up with the evolution of LLM Agents from AutoGPT to connecting LLMs with Vector Databases or Wolfram Alpha, as well as the ChatGPT Marketplace, Personalized LLMs, Private LLMs, and many more! I think there are so many interesting nuggets from this podcast, thank you so much to Greg and Colin for joining, really enjoyed this one! Data Independent: https://www.youtube.com/@DataIndependent Greg Kamradt on Twitter: https://twitter.com/GregKamradt Nesh: https://hellonesh.io/ Colin Harmon on LinkedIn: https://www.linkedin.com/in/coluha/ Colin Harmon Blog: https://colinharman.substack.com/ Colin Harmon at Haystack US 2023: https://www.youtube.com/watch?v=LO3U5iqnTpk Chapters 0:00 Introduction 0:42 Backgrounds 2:43 Defining “LLM Agents” 6:12 Data-Aware LLMs 13:04 Tool Use 13:38 ChatGPT API vs. Marketplace 17:40 Personalized LLMs, LLM for Greg 19:20 PrivateGPT 25:14 AutoGPT and Chain-of-Thought Prompting 32:30 Few-Shot Examples 35:30 Early AI Signals and Open-Source 43:10 Multi-Agent LLMs 47:14 Fine-Tuning and Long Input Lengths 52:20 Greg’s LLM Wishlist Hierarchy 53:15 Keeping up with Greg and Colin!

Jun 2, 2023 • 28min

Retrieving Texts based on Abstract Descriptions Explained!

This video explores a new paper exploring the use of summarization chains to represent long texts and use (original text, summary) pairs for optimizing text embeddings models! Here are 3 main takeaways I think everyone working with Weaviate may get value from: 1. Understanding of Summary Indexing and the Prompts (as well as Prompt Chains) used to build them. 2. Continued development of LLM-generated data for search -- creating (full text, summary) pairs gives you (1) data to build a summary index with as mentioned, (2) data to compare different embedding models with, and (3) data to train your own embedding model. 3. Tournament style evaluation with human annotators -- the top 5 retrieved texts from one model are concatenated with the top 5 from another model, these 10 are given to human annotators to pick 5 and this is how the authors are reporting the performance of their models rather than traditional benchmarks. This m ay be a more productive evaluation technique for most real world search applications. Thank you so much for watching, here are some links mentioned in the video! Retrieving Texts based on Abstract Descriptions: https://arxiv.org/abs/2305.12517 Weaviate Blog - Combining LangChain and Weaviate: https://weaviate.io/blog/combining-langchain-and-weaviate Weaviate Blog - Generative Feedback Loops: https://weaviate.io/blog/generative-feedback-loops-with-llms Jerry Liu in Llama Index Blog - A New Document Summary Index for LLM-powered QA Systems: https://medium.com/llamaindex-blog/a-new-document-summary-index-for-llm-powered-qa-systems-9a32ece2f9ec Learning to Retrieve Passages without Supervision (Spider): https://arxiv.org/pdf/2112.07708.pdf Weaviate Blog - Analysis of Spider - https://weaviate.io/blog/research-insights-spider Chapters 0:00 Introduction 0:13 Quick Overview 7:30 How to use in Weaviate! 7:50 Background 12:08 Motivation 14:20 Prompts Used 18:14 More Details of training 21:12 Human Evaluation Study 22:40 My Takeaways from the Paper

May 31, 2023 • 36min

Kapa AI with Emil Sorensen and Finn Bauer - Weaviate Podcast #50!

Hey everyone, thank you so much for watching the 50th (!!!) Weaviate Podcast with Emil Sorensen and Finn Bauer from Kapa AI! Are you curious about taking either your, or your company's, specific information and putting into a Vector DB + LLM system? Emil and Finn are doing this at the highest level, taking the documentation of software companies like Weaviate and building these LLM-augmetnted assistant systems for them. This podcast takes a complete tour from Data Ingestion to Cleaning, Chunking, LLM latency, and emerging trends in LLMs such as cheap fine-tuning with LoRA or Long Context Windows such as GPT-4 32K, MPT-7B 65K, or Anthropic Claude's 100k. I learned so much from speaking with Emil and Finn! Please let us know any questions you have or ideas you would like to discuss! Check out Kapa here! https://www.kapa.ai/ Chapters 0:00 Welcome Emil and Finn! 0:42 Origin Story of Kapa 2:08 Data Ingestion 5:10 Data Cleaning 6:20 Slack / Discord / Forum Ingestion 9:05 Testing Models on Support QA 11:14 Selling Kapa to Weaviate and friends 12:37 Hallucinations in LLMs 14:06 Trends in Open-Source LLMs 15:20 Long Input LLMs (32K, 65K, 100K, …) 16:54 Retrieval-Augmentation for Long Input LLMs 18:08 Fine-Tuning LLMs 23:00 As much or as refined content as possible? 24:40 Adding Docs from Integrations 26:15 Generative Feedback Loops 29:00 What in AI excites you the most?

May 25, 2023 • 1h 30min

Neurosymbolic AI in Search with Professor Laura Dietz - Weaviate Podcast #49!

Professor Laura Dietz discusses Neurosymbolic Search, Entity Linking, Entity Re-Ranking, Knowledge Graphs, and Large Language Models. They explore the potentials of bias in using LLMs for relevance judgments and the complexities of merging neural technologies with symbolic systems in search technology. The conversation delves into enhancing search algorithms, filtered vector search, entity linking with context-specific models, and the nuances of relevance judgments in research papers.

May 23, 2023 • 43min

Unstructured with Brian Raymond - Weaviate Podcast #48!

Hey everyone, thank you so much for watching the 48th episode of the Weaviate Podcast!! This is a SUPER exciting one, welcoming Brian Raymond the CEO / Founder of Unstructured! Unstructured is a perfect complimenting technology for Weaviate, helping people get their Unstructured data into Weaviate! The podcast dives into the nuances of this task, but it generally revolves around Unstructured's abstraction of Partitioning, Cleaning, and Staging! Unstructured is making groundbreaking innovations on using Visual Document Layout models for Partitioning, for example saying that this part of the PDF is the header, body, image caption, and so on. Cleaning then describes removing pesky details like whitespaces or odd characters. Staging then describes the transformations of say formatting a text chunk with it's metadata into the JSON for a Weaviate object upload! I really hope you find this podcast interesting! We are publishing a blog post as well showing an example of how to use Unstructured to get PDF data into Weaviate, please please check that out and let us know if it works for your data and how we can improve it! This blog post can be found on weaviate.io and we will be managing discussions around it both in the Weaviate slack, as well as Unstructured! Thank you so much for listening! Check out Unstructured here! https://www.unstructured.io/ Chapters 0:00 Welcome Brian!! 0:27 What is Unstructured? 5:42 Why now? New Advancements in Unstructured 8:02 Thoughts on Data Connectors Hub 10:55 PDFs to Weaviate with Unstructured 13:53 State-of-the-Art in OCR and Document Parsing 16:10 How to get the data from Weaviate.io? 18:06 Foundation Models from Unstructured 20:45 Evaporate-Code+ 23:15 CSV, Parquet, JSON transformations in Staging 25:08 Cleaning Bricks 28:02 Visual Document Examples 30:45 Text Chunking with Metadata 33:25 Knowledge Graphs with Goldman Sachs example 39:10 LLM Hallucinations 42:10 Announcements from Brian!

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

App store banner

Play store banner