Data Brew Season 1 Episode 3: Demystifying Delta Lake

4 snips

Dec 6, 2020

In this podcast, Michael Armbrust, the creator of Spark SQL, discusses the conception and evolution of Delta Lake, efficient querying and troubleshooting slow queries, optimizing performance and query speed, understanding partitioning and Z Order, and exciting features for data ingestion and schema handling in Delta Lake.

Ask episode

Chapters

Transcript

Episode notes

Introduction

00:00 • 2min

Conception and Evolution of Delta Lake

02:15 • 12min

Efficient querying and troubleshooting slow queries on Delta tables

14:40 • 2min

Optimizing Performance and Query Speed in Delta Lake

16:21 • 6min

Understanding Partitioning and Z Order in Delta Lake

21:59 • 2min

Exciting Features for Data Ingestion and Schema Handling in Delta Lake

24:04 • 2min