Weaviate Podcast

Semantic Query Engines with Matthew Russo - Weaviate Podcast #131!

14 snips
Nov 18, 2025
Matthew Russo, a Ph.D. student at MIT, dives into the world of semantic query processing engines and their potential to revolutionize database systems. He discusses the emergence of semantic operators like AI_WHERE and their role in transforming how we handle unstructured data. With insights on optimizing query planning and the benefits of filtering order, Matthew also introduces SemBench, a crucial standardized benchmark for evaluating semantic queries. Expect a lively exploration of the future of AI in databases and practical optimization strategies!
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

LLMs Turn SQL Into Semantic Queries

  • Foundation models enable on-the-fly semantic operators like filters and joins over images, audio, and text.
  • These operators let SQL-like queries ask natural-language predicates that models evaluate directly.
INSIGHT

Cost And Quality Dominate Semantic Queries

  • Semantic operators bring cost and variable quality trade-offs because each invocation may use expensive models.
  • Optimizing for LLM cost and accuracy becomes the central challenge for semantic query engines.
INSIGHT

Operator Types Unlock Different Optimizations

  • Separating semantic operators (map vs filter vs classify) enables different optimization strategies.
  • Filters support proxy/oracle cascades, while maps need different approaches because outputs are non-binary.
Get the Snipd Podcast app to discover more snips from this episode
Get the app