
The Data Stack Show 185: The Evolution of Data Processing, Data Formats, and Data Sharing with Ryan Blue of Tabular
Apr 10, 2024
Ryan Blue, expert in data processing and metadata formats, discusses the evolution of data processing, challenges in transitioning to S3, impact of latency on query performance, designing a new metadata format, and the trade-offs in writing workloads. He also explores the vendor influence on access controls, restructuring data security, exciting releases and future plans, and the fundamental shift in data architecture.
Chapters
Transcript
Episode notes
1 2 3 4 5 6 7 8
Introduction
00:00 • 4min
Evolution of Data Processing and Data Formats in Big Data
04:08 • 22min
Evolution of Iceberg and Automated Partitioning
26:12 • 15min
Exploring Atomicity in Data Writing and OLTP vs. Analytic Systems
41:35 • 4min
Exploring Data Processing Challenges and Write Amplification in Analytical Workloads
45:43 • 5min
Managing Access Control in Data Systems
50:37 • 27min
Advancements in Data Processing World
01:17:23 • 9min
Evolution of Data Practitioners and Centralization in Data Architecture
01:26:37 • 3min
