Latent Space: The AI Engineer Podcast cover image

Long Live Context Engineering - with Jeff Huber of Chroma

Latent Space: The AI Engineer Podcast

00:00

Context Chunk Selection

  • To select the most relevant context chunks, narrow down the candidate chunks from thousands to hundreds using first-stage retrieval methods like vector search and metadata filtering.
  • Then, use an LLM as a re-ranker to brute force from 300 chunks down to 30, which is more cost-effective than many realize.
Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app