The Data Stack Show cover image

The Data Stack Show

Latest episodes

undefined
Jan 14, 2022 • 6min

The PRQL: Is Kostas an Excel Power User Yes/No?

Eric and Kostas preview the upcoming conversation with Jimmy Chan of Dropbase.
undefined
Jan 12, 2022 • 1h

70: The Difference Between Data Lakes and Data Warehouses with Vinoth Chandar of Apache Hudi

Highlights from this week’s conversation include:Vinoth’s career background (3:19)Building a data lake at Uber (6:52)Defining what a data lake is (14:01)How data warehouses differ from data lakes (22:46)When you should utilize an open source solution in your datastack (37:36)Evolving from a data warehouse to a data lake (45:09)Early wins Hudi earned inside of Uber (52:30)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
Jan 7, 2022 • 5min

The PRQL: What Old Tech Concepts Were Borrowed to Build the Data Lake House?

Eric and Kostas preview the upcoming show as they talk about data lakes and data warehouses and why these are important.
undefined
Jan 5, 2022 • 1h 4min

69: What is the Modern Data Stack?

Highlights from this week’s conversation include:Panel introductions and backgrounds (2:55)What the modern data stack means to each of our panelists (5:04)Defining the fundamental components of a modern data stack (17:22)How the modern stack drives insights and actions for businesses (28:03)Getting to a uniform definition to the modern stack (33:45)Managing the modernization of a large scale data stack (39:09)How testing works in the dbt context (48:44)The relationship between the data warehouse and the data lake (52:25)What has us most excited or the future of modern data stacks (56:02)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
Dec 31, 2021 • 5min

The PRQL: Should Data Trust Drive the Evolution of Your Data Stack?

In this PRQL, Eric and Kostas preview their upcoming show where they discuss the modern data stack with some of the top experts in the industry.
undefined
Dec 29, 2021 • 25min

68: Season Three Recap: Holiday Edition with Eric Dodds and Kostas Pardalis

In this episode, Eric and Kostas look back over the great topics and guests from season three of the Data Stack Show.  The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
Dec 22, 2021 • 56min

67: Now is the Time to Think About Data Quality with Manu Bansal of Lightup Data

Highlights from this week’s conversation include:Manu’s career background and describing Lightup (2:31)Why traditional tools don’t work for modern data problems (6:04)How a data lake differs from a data warehouse (11:35)Defining data quality (14:07)The business impact of solving and applying data quality (31:36)Constructing a healthy financial view on the impact of data (41:09)How to work with unstructured data in a meaningful way (47:44)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
Dec 21, 2021 • 4min

The PRQL: Will Data Quality Always Require a Human in the Loop?

Eric and Kostas preview the upcoming show by talking about data quality.
undefined
Dec 15, 2021 • 51min

66: How Data Infrastructure Has Evolved and Managing High Performing Data Teams with Srivatsan Sridharan

Highlights from this week’s conversation include:Starting his career on the first-ever data team at Yelp (2:00)How to approach the adoption of new technology (7:04)When to use stream processing vs. batching (11:35)What is a pipeline and why is it core to a data engineer? (14:07)Where a new data scientist should begin their career (19:14)The key factors impacting a new technology decision (27:09)Managing team emotions in decision making (34:25)The unique challenge of Fintech vs other consumer industries (45:03)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
Dec 10, 2021 • 5min

The PRQL: How Would You Define a Data Pipeline? Featuring the RudderStack Eng. Team

On the PRQL this week, Eric and Kostas bring in some of the Rudderstack engineering team to discuss data pipelines and preview episode 66 of the Data Stack Show.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode