

The Data Stack Show
Rudderstack
Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.
Episodes
Mentioned books

Feb 2, 2022 • 57min
73: What a High Performing Data Team (and Stack) Looks Like with Paige Berry of Netlify
Highlights from this week’s conversation include:Paige’s career path (2:44)Paige’s role and responsibilities at Netlify (6:38)Sharing data insights (8:55)Scope in the context of delivering an insight (12:39)Defining “insight” (15:10)Where the client journey begins (16:43)Miscommunication because of vague terminology (20:06)Netlify’s internal knowledge repository (23:01)Breaking down Netlify’s hub and spoke model (30:45)What data tools to use and when (35:21)The metric layer and BI (44:17)Next steps in the data space (49:42)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Jan 28, 2022 • 4min
The PRQL: How High Performing Data Teams Put Tooling in the Background
This week on the PRQL, Eric and Kostas discuss tooling as they preview the upcoming show with Paige Berry of Netlify.

Jan 26, 2022 • 55min
72: Building Data Ops Into the Data Lifecycle with Douwe Maan of Meltano
Highlights from this week’s conversation include:Douwe’s career journey (3:04)The missing piece in GitLab’s data tooling (7:35)The open-source offering in the data space (12:38)Singer’s connection with Meltano (22:31)How Meltano manages connectors on a diverse codebase (35:21)The data house side of Meltano (39:47)Data house operating versus Airflow (44:06)Meltano’s vision present today (47:02)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Jan 21, 2022 • 6min
The PRQL: Is It Viable to Manage Integrations Open Source?
Eric and Kostas preview the upcoming show featuring Douwe Maan of Meltano.

Jan 19, 2022 • 57min
71: ETL at the Edges with Jimmy Chan of Dropbase
Highlights from this week’s conversation include:Jimmy’s career background (3:01)How to use Data cubes (5:52)What Dropbase is and who it is built for (11:01)Getting sales and marketing data in usable formats (16:46)Ensuring data remains flexible and transferable (28:36)Defining what “offline data” is and how to use it (34:09)How Dropbase can work with the rest of the data stack (43:30)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Jan 14, 2022 • 6min
The PRQL: Is Kostas an Excel Power User Yes/No?
Eric and Kostas preview the upcoming conversation with Jimmy Chan of Dropbase.

Jan 12, 2022 • 1h
70: The Difference Between Data Lakes and Data Warehouses with Vinoth Chandar of Apache Hudi
Highlights from this week’s conversation include:Vinoth’s career background (3:19)Building a data lake at Uber (6:52)Defining what a data lake is (14:01)How data warehouses differ from data lakes (22:46)When you should utilize an open source solution in your datastack (37:36)Evolving from a data warehouse to a data lake (45:09)Early wins Hudi earned inside of Uber (52:30)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Jan 7, 2022 • 5min
The PRQL: What Old Tech Concepts Were Borrowed to Build the Data Lake House?
Eric and Kostas preview the upcoming show as they talk about data lakes and data warehouses and why these are important.

Jan 5, 2022 • 1h 4min
69: What is the Modern Data Stack?
Highlights from this week’s conversation include:Panel introductions and backgrounds (2:55)What the modern data stack means to each of our panelists (5:04)Defining the fundamental components of a modern data stack (17:22)How the modern stack drives insights and actions for businesses (28:03)Getting to a uniform definition to the modern stack (33:45)Managing the modernization of a large scale data stack (39:09)How testing works in the dbt context (48:44)The relationship between the data warehouse and the data lake (52:25)What has us most excited or the future of modern data stacks (56:02)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Dec 31, 2021 • 5min
The PRQL: Should Data Trust Drive the Evolution of Your Data Stack?
In this PRQL, Eric and Kostas preview their upcoming show where they discuss the modern data stack with some of the top experts in the industry.