Data Skeptic cover image

NLP in 2019

Data Skeptic

00:00

Authorship Attribution

In the supervised setting, what you are trying to learn is a clust alberism with is learned perameters that could help you characterize writing style of a particular author. Different authors use punctuation differently, for exampleand so that's an easy way to figure out who wrote charles dickens because he usually puts a comma in between his noun and verb as fairly common back in those days. The use of punctuation to denote pauses is supposed to nowadays, it's lot more rule based.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app