Towards Data Science

124. Alex Watson - Synthetic data could change everything

19 snips
May 18, 2022
Ask episode
Chapters
Transcript
Episode notes
1
Introduction
00:00 • 2min
2
What Is Synthetic Data?
02:29 • 2min
3
Is There a Synthetic Data Technique?
04:41 • 4min
4
The Biggest Blocker for Data Scientists
09:11 • 2min
5
Is There a Strategy for Insuring Synthetic Data Doesn't Leak?
10:56 • 4min
6
Synthesizing Data - What's the Nuts and Bolts of Synthesis?
14:46 • 3min
7
How to Train a Machine Learning Model From Random Weights With a Minimum Loss on Accuracy?
17:26 • 3min
8
What's the Nature of the Trade Off Between Performance and Privacy?
20:26 • 3min
9
How Do You Get Rid of Outliers?
23:36 • 2min
10
What Is Differential Privacy?
25:18 • 2min
11
Differential Privacy Is a Hammer That You Really Want to Use
27:40 • 2min
12
Differential Private Training - A Very Broad Tool for Scale Application
29:32 • 3min
13
Is There a Consistent Pattern in the Failure Modes?
32:10 • 2min
14
Data Science - The Long Astat Score
34:19 • 2min
15
Synthetic Data Analysis
36:33 • 4min
16
Data Exploration - Can You Just Call This by Itself?
40:30 • 2min
17
The Challenges of Synthetic Data?
42:13 • 2min
18
Can We Synthesize M G Noms From Mice?
44:19 • 3min
19
I Love the Generalizability of Large Language Models
47:05 • 2min
20
Are We Going to a World Where Machine Learning Isn't Just Data Preparation?
49:03 • 3min