Latent Space: The AI Engineer Podcast cover image

RAG Is A Hack - with Jerry Liu from LlamaIndex

Latent Space: The AI Engineer Podcast

NOTE

Optimizing retrieval and bias in embedding models

There are many parameters to consider in retrieval, such as the chunking algorithm, metadata definition, embedding model, and retrieval method. Open AI's embedding model is commonly used, but sentence transformers are popular due to being open source. Fine-tuning the embedding model can improve performance, but it may require re-indexing documents. An alternative is training a transform on the query side. While it may not result in significant gains, it is worth trying. Open AI also provides a cookbook on adding bias to embeddings.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner