

#110 - Data Quality - The Hard Parts w/ Jeremy Stanley (Anomalo)
19 snips Jan 23, 2023
Chapters
Transcript
Episode notes
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
Introduction
00:00 • 2min
Data Quality Is One of Your Biggest Project Killers
02:01 • 3min
Is There a Certain Checklist for Visualization?
04:33 • 2min
How Much Time Do You Give Yourself to Get Up to Speed?
06:47 • 3min
Big Data Machine Learning - A Fresh Perspective on Problems
10:00 • 3min
Data Quality - What's the Definition?
12:33 • 5min
Preventing Alert Fatigue by Providing Visualizations and Summary Statistics Right Off With an Alert
17:09 • 2min
The Bystander Effect of Data Quality Issues
18:45 • 5min
The Slow Erosion of Data Quality
23:53 • 2min
Data Quality
25:32 • 4min
Machine Learning Models Need to Be Tuned to Deal With Data Quality Issues
29:34 • 2min
Boosted Decision Trees - Is That a Good Place to Start?
32:02 • 3min
Detecting Anomalies With Machine Learning
34:48 • 5min
Instacart: Machine Learning Fully Automated Approach
39:20 • 3min
Large Language Models and Data Quality
42:05 • 2min
The Future of Machine Learning in the Context of Text
43:59 • 5min
Using Structured Data in a Data Warehouse Is a Good Idea
49:26 • 3min
Is the Deep Learning Model Conscious?
52:16 • 2min
Structured Data and Data Quality Issues
54:11 • 3min
Jeremy Stan and Anamolo. Awesome.
56:53 • 2min