Neural Search Talks — Zeta Alpha cover image

ColBERT + ColBERTv2: late interaction at a reasonable inference cost

Neural Search Talks — Zeta Alpha

00:00

Efficient Indexing through Centroids and Delta Vectors

Improving indexing efficiency involves using centroids and delta vectors to cluster term embeddings. By storing term embeddings as the ID of the nearest centroid plus a quantized delta vector, storage costs are lowered significantly. The process reduces the search footprint of the index while maintaining an approximation of the original term embeddings.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner