The Real Python Podcast cover image

Natural Language Processing and How ML Models Understand Text

The Real Python Podcast

00:00

How to Solve the Cat and Cat Problems in a Second

If we're literally taking words, raw words, and we're ing every single one into its own column, you're going to have problems. We can do tricks to solve for the grammatical differences. One is called stemming, and another is called lematisation, which is a ridiculous wordye a. But both of these are reproaches where you're kind of trying o reduce words that mean the same thing. Soour sort of like filtering in em sort of setting a scale of saying, this should be within, you know, this many times mentioned exactly, exactly.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app