Latent Space: The AI Engineer Podcast cover image

RAG Is A Hack - with Jerry Liu from LlamaIndex

Latent Space: The AI Engineer Podcast

00:00

Optimizing embedding models for performance and efficiency

It should be relatively free for every developer to run some fine tuning process over their data for improved performance./nOptimizing the embedding model in a production grade data pipeline may require re-indexing documents./nA possible solution is to keep document embeddings frozen and train a transform on the query instead./nTrying different parameters can help optimize the retrieval process by adding bias to the embeddings./nThe text exists in a latent space.

Play episode from 40:50
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app