Data Engineering Podcast

Building A Knowledge Graph From Public Data At Enigma With Chris Groskopf - Episode 50

Oct 1, 2018
Ask episode
Chapters
Transcript
Episode notes
1
Introduction
00:00 • 2min
2
Enigma - What's the Story?
01:44 • 2min
3
Enigma - Knowledge Grafts
04:13 • 2min
4
Enigma and Taxonomy
06:18 • 2min
5
The Value of Knowledge Graphs
08:33 • 3min
6
The Data Platform and the Data Platform Architecture?
11:24 • 5min
7
The Ontology Driven Validation of Data Sources
16:52 • 2min
8
Enigma and Date Engineering - What's the Hardest Challenge?
19:06 • 5min
9
Is There a Problem With Unit Testing for ETL Processes?
24:17 • 4min
10
Regeneration of the Knowledge Graft on Demand
28:43 • 2min
11
Scaling Out the Number of Data Sets
30:22 • 2min
12
Using the Knowledge Graph to Traverse the Attributes of a Data Set
32:50 • 2min
13
Do You Have a Production Environment or a Preproduction Environment?
34:28 • 3min
14
Is There a Difference Between Staging in Production and What's in Production?
37:02 • 2min
15
Scaling a Scaled Workflow?
38:36 • 2min
16
Are There Any Major Decisions You Would Make Differently?
40:23 • 2min
17
Typical Use Cases That You've Seen So Far
42:08 • 4min
18
Enigma - What's Next for Enigma?
45:49 • 2min
19
Enigma - What's the Model for Writing Data Processing Code?
47:55 • 2min
20
What's the Biggest Gap in the Tooling for Data Management?
50:23 • 2min