The Real Python Podcast cover image

Natural Language Processing and How ML Models Understand Text

The Real Python Podcast

00:00

How to Do a Text Classification Project in Python?

In python, countvictorization and t f idea victorization can be used to normalize text. It's a way of comparing words that are twice as long or ten times as long. You don't really need to think that much about how to use these methods on large amounts of text. I actually did a project relatively recently where i built a hate speech classifier, and i got 80 % accuracy using just these methods for pre process the text.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app