The Data Stack Show cover image

The Data Stack Show

Latest episodes

undefined
Sep 2, 2022 • 5min

The PRQL: Who Really Needs To Know How a DBMS Works?

In this bonus episode, Eric and Kostas preview their upcoming conversation with Kyle Weller of Onehouse.ai.
undefined
Aug 31, 2022 • 49min

102: Building Pinot for Real-Time, Interactive User Analytics with Kishore Gopalakrishna of StarTree

Highlights from this week’s conversation include:Kishore’s background and career journey (2:30)Internal analytics versus user-facing analytics (3:49)New ways of thinking about analytics (8:06)What makes Pinot different (13:45)How Pinot transforms systems (21:53)Understanding the data landscape (32:40)The Pinot user experience (36:27)Something exciting about StarTree (40:05)When you should adopt this technology (43:15)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
Aug 26, 2022 • 3min

The PRQL: Data Warehouses on Steroids

In this bonus episode, Eric and Kostas preview their upcoming conversation with Kishore Gopalakrishna of StarTree.
undefined
Aug 24, 2022 • 1h 4min

101: The Future of Machine Learning with Willen Pienaar of Tecton and Tristan Zajonc of Continual

Highlights from this week’s conversation include:When is it right to use ML? (5:22)ML business models (10:21)Significant changes in delivering ML (19:07)Why ML is different (25:19)SQL becoming more important (34:39)Graduating from SQL-based to real-time (37:22)Space for a new role (45:11)State-of-the-art models (49:03)The most exciting thing in the ML space (53:59)Open source in ML (56:39)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
Aug 19, 2022 • 5min

The PRQL: Can Machine Learning Be Commoditized?

In this bonus episode, Eric and Kostas preview their upcoming live stream episode featuring Willem Pienaar of Tecton and Tristan Zajonc of Continual.
undefined
Aug 17, 2022 • 54min

100: Data Quality Is Relative to Purpose with James Campbell of Superconductive

Highlights from this week’s conversation include:James’ role at Great Expectations (2:33)What Great Expectations does (5:49)How Great Expectations approaches data quality (7:01)Why a data engineer should use Great Expectations (16:41)Defining “data quality” (19:16)Translating expectations from one domain to the other (27:00)Community around Great Expectations (30:59)The user experience (33:41)Something exciting on the horizon (40:27)Interacting with marketers in a non-technical way (43:57)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
Aug 12, 2022 • 4min

The PRQL: What’s the Hardest Part About Data Quality?

Eric and Kostas preview their upcoming conversation with James Campbell at Superconductive.
undefined
5 snips
Aug 10, 2022 • 1h 13min

99: State of the Data Lakehouse with Vinoth Chandar of Apache Hudi

Highlights from this week’s conversation include:Vinoth’s background and career journey (3:08)Defining “data lakehouse” (5:10)Databricks versus lake houses (13:37)The services a lakehouse needs (17:37)How to communicate technical details (26:55)Onehouse’s product vision (31:41)Lakehouse performance versus BigQuery solutions (36:44)How to deliver customer experience equally (40:17)How to start building a lakehouse (44:00)Big tech’s effect on smaller lakehouses (55:33)Skipping the data warehouse (1:04:39)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
Aug 5, 2022 • 5min

The PRQL: Does Lakehouse Architecture Really Mean the End of the Data Warehouse and Data Lake As We Know It?

In this bonus episode, Eric and Kostas preview their upcoming conversation with Vinoth Chandar of Apache Hudi.
undefined
Aug 3, 2022 • 1h 2min

98: Category Theory and the Mathematical Foundation of the Technologies We Use with Eric Daimler of Conexus

Highlights from this week’s conversation include:Eric’s background and career journey (3:30)Presenting to people without knowledge of AI (11:04)Why math was chosen over AI (19:03)From compilers to databases (25:42)The contribution of category theory (30:09)The Connexus customer experience (37:45)The primary user of Connexus (46:33)Interacting with 300,000 databases (51:07)When Connexus begins to add value (54:02)The best way to learn this mathematical approach (55:46)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode