4min snip

Weaviate Podcast cover image

The Future of Search with Nils Reimers and Erika Cardenas - Weaviate Podcast #97!

Weaviate Podcast

NOTE

Mind the Distribution Shift: Impact on Model Performance

Different retrieval models like embeddings and core work can be highly sensitive to distribution shifts. Many people use the same approach to create datasets and training data, leading to inflated performance gains which may not be sustainable. Companies promoting this approach may be overlooking the sensitivity of models to changes in query profiles. Language models trained on clean data without spelling or grammar mistakes can perform poorly when faced with real user queries that are typically riddled with errors. This mismatch in data quality can lead to models performing worse than their base versions.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode