Data Skeptic cover image

[MINI] Natural Language Processing

Data Skeptic

00:00

Detecting Similarities in Text and Basic Techniques in Natural Language Processing

This chapter explores how word frequency counts can be used to detect similarities between books, using authors like Isaac Asimov and Arthur C. Clark as examples. It also introduces basic techniques in Natural Language Processing (NLP) such as tokenization, stemming, N-grams, and part of speech (POS) tagging, while touching on the challenges in computer understanding of language. The chapter concludes with a mention of an upcoming interview related to NLP.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app