3min snip

What's AI Podcast by Louis-François Bouchard cover image

Jerry Liu on the Future of AI: LlamaIndex, LLMs, RAG, Prompting and more ! What's AI Podcast Episode 25

What's AI Podcast by Louis-François Bouchard

NOTE

Fine-tuning Embeddings for Data Domain Adaptation

An existing embedding, generated by a black box like OpenAI ADAP, can be fine-tuned with a transform to better model specific data. This process involves adding an adapter model on top of the base model, which can be fine-tuned on the document side or the query side. Lomidex offers the capability to fine-tune and encourages users to explore this approach. Additionally, fine-tuning weaker models like llama2 to output structured data and distilling prompts and instructions from more powerful models like JupyT4 are core use cases. Regarding the data pipeline for a rag, the chunking strategy plays a crucial role, and sub-optimal strategies can lead to a failing pipeline. Factors to consider include the quality of the file parser and selecting the appropriate chunking size and strategy.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode