

Episode 503: Diarmuid McDonnell on Web Scraping
Mar 16, 2022
Chapters
Transcript
Episode notes
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21
Introduction
00:00 • 2min
Web Scraping vs Screen Scraping
01:51 • 2min
The Use Cases for Web Scraping
04:12 • 4min
Web Scraping
07:53 • 2min
The Challenges of Web Scraping
10:20 • 3min
The Challenges of Web Scraping
13:38 • 3min
Web Scraping - Data Quality Challenges
16:17 • 2min
Web Scraper Reliability
18:07 • 3min
Is There an Active Community for Web Scraping?
21:02 • 2min
Web Scraping - How Much Has It Been for You?
23:24 • 2min
How to Scrape a Non Profit Sector?
25:19 • 2min
How to Scrape Non Profits
27:07 • 3min
Web Scraping
29:49 • 2min
Python Versus Java
31:47 • 2min
Web Scraping - Should I Use Ore?
33:22 • 3min
Using Machine Learning and Natural Language Processing on Web Scraping
36:47 • 2min
How Much Data Do You Need?
38:52 • 2min
Web Scraping
41:09 • 4min
Web Scraping as a Social Scientist
45:07 • 2min
Learning Computatial Methods
46:59 • 2min
Learning From Other Academics
48:29 • 3min