Machine Learning Guide cover image

MLA 010 NLP packages: transformers, spaCy, Gensim, NLTK

Machine Learning Guide

00:00

Psychi Learn T F Ida F Vectorizer

Gensem's l da algorithm looks at the distribution of key words as they occur in themes across documents. This concept is called topic modelling, and it goes like this: You take your documents, the we call this a corp s,. bunch of text, you pull out the key words. Now at this time in our time line, you are going to be using n l t k to pull out the Key Words. N l t k tokenizes, removes stop words, lematizes your tokens. And then now you have your corpus converted into key words, key words. Then we take the psychi learn t f i d f vectorizer tool, t f ida f vector

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app