How AI Is Built  cover image

#22 Nils Reimers on the Limits of Embeddings, Out-of-Domain Data, Long Context, Finetuning (and How We're Fixing It) | Search

How AI Is Built

NOTE

Maximize Performance by Combining Techniques

Using traditional machine learning techniques like XGBoost, support vector machines, and logistic regression on top of frozen embeddings can yield surprisingly strong performance for classification tasks. This approach requires less training time and often shows greater robustness, especially in scenarios involving out-of-domain or cross-lingual classification. By leveraging these techniques, one can efficiently adapt to multiple languages using a foundation of training data in a single language. Additionally, combining various embedding models generates diverse features that enhance classification effectiveness, demonstrating the potential of simpler models when applied judiciously in conjunction with embeddings.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode