Machine Learning Guide cover image

MLG 019 Natural Language Processing 2

Machine Learning Guide

00:00

What Is a Bag of Words Model?

A bag of words model takes your document, cuts it up into tokens or bigrams and generates a ector. The vector size is going to be the number of terms in the dictionary. And if the word is present in this document, in this e male, in this web page, then we put a one in that column. Is called a sparse vector because there are very few ones, the rest are mostly zeros. If google had this system implemented as their primary search engine, people could add to the end of any web page any key words they want to show up for as highly relevant for search quew.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app