3min chapter

Learning from Machine Learning cover image

Nils Reimers: Sentence Transformers, Search, Future of NLP | Learning from Machine Learning #3

Learning from Machine Learning

CHAPTER

Train Sentence Encoder

The original sentence transformers model was just trained like small dataset from NLI. So that's what the first version of sentence transformers was trained on and sadly evaluated on. But now it was trained on like a whole bunch of text, like really ugly, noisy social media text full of hashtags and emojis. Now the model understands like emojis and hashtags and knows, okay, what's the similarity between hashtags and relationship in hashtags. Right. And this gives you a much, much better model,. sadly, sometimes if people use it on the old benchmarks on the nicely cleanly written text, doesn't perform as well as models overfitted on these settings.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode