
[MINI] Natural Language Processing
Data Skeptic
00:00
Detecting Similarities in Text and Basic Techniques in Natural Language Processing
This chapter explores how word frequency counts can be used to detect similarities between books, using authors like Isaac Asimov and Arthur C. Clark as examples. It also introduces basic techniques in Natural Language Processing (NLP) such as tokenization, stemming, N-grams, and part of speech (POS) tagging, while touching on the challenges in computer understanding of language. The chapter concludes with a mention of an upcoming interview related to NLP.
Transcript
Play full episode