How AI Is Built  cover image

How AI Is Built

BM25 is the workhorse of search; vectors are its visionary cousin | S2 E14

Nov 15, 2024
David Tippett, a search engineer at GitHub with expertise in BM25 and OpenSearch, delves into the efficiency of BM25 versus vector search for information retrieval. He explains how BM25 refines search by factoring in user expectations and adapting to diverse queries. The conversation highlights the challenges of vector search at scale, particularly with GitHub's massive dataset. David emphasizes that understanding user intent is crucial for optimizing search results, as it surpasses merely chasing cutting-edge technology.
54:05

Episode guests

Podcast summary created with Snipd AI

Quick takeaways

  • BM25 excels in efficiency and versatility for information retrieval, adapting well to varied query types without needing complex models.
  • Effective search optimization relies on understanding user intent and behavior, ensuring results align with diverse expectations and use cases.

Deep dives

Downsides of Vector Search

Vector search presents various challenges that need to be addressed for effective implementation. One major issue is its lack of robustness in handling different types of queries, as each may require different embedding models and vector indexes. Additionally, the computational and storage costs associated with keeping indexes in memory for low latency are significant. This inefficiency makes it less suitable for a diverse set of search queries compared to traditional methods.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode