Data Engineering Podcast

An Exploration Of Tobias' Experience In Building A Data Lakehouse From Scratch

36 snips
Dec 26, 2022
Ask episode
Chapters
Transcript
Episode notes
1
Introduction
00:00 • 2min
2
Data Contracts - What's Data Mesh?
01:42 • 2min
3
Building Your Data Platform
03:21 • 5min
4
Building Out Your Data Platform
08:17 • 6min
5
Building a Self-Serve Platform - What's the Roadmap?
14:08 • 4min
6
Are You Creating a Data Shrek?
17:54 • 2min
7
How Do You Keep Your Focus on What Matters?
19:48 • 3min
8
AWS Glue and DAGster as the Orchestration Engine
23:08 • 3min
9
Are You Still Reacting to the Schema Change?
26:07 • 3min
10
Build vs Buy, You Know?
28:39 • 3min
11
What's the Roadmap for Open Source?
32:09 • 3min
12
AirBite
34:58 • 3min
13
The Orchestration Layer Is the Source of Truth
38:23 • 2min
14
The Total Cost of Ownership
40:37 • 2min
15
Do You Have a Place to Start?
42:30 • 3min
16
Data Engineering Podcast - Was There Anything Surprising About Monte Carlo?
45:15 • 3min
17
Is There a Primary Place for X Metadata?
48:39 • 4min
18
Is There Anything You Don't Know About Data Modeling?
52:59 • 3min
19
The Limitation of Technical Debt
55:43 • 2min
20
The DBT Model of Landed in Raw, Nobody Ever Touches Raw for Anything
57:46 • 2min
21
What Are You Building?
01:00:03 • 3min
22
Do You Have Any Advice for Vendors?
01:02:55 • 3min
23
Are You Good Enough to Be on the Show?
01:05:36 • 3min
24
Is There a Missing Link in Data Platform and Analytics?
01:08:09 • 4min