Feature Processing for Text Analytics
Linear Digressions
00:00
The Simplest Way to Represent Words in Vectors
Every single word that you see is one of the words that you might have with here, with no duplicates. So there has to be some kind of mapping if you want to go back and forth between words and numbers. And then probably most of the spots in that vector ar zerois because most words don't occur in most documents. Got it?
Play episode from 03:28
Transcript


