The Data Stack Show cover image

The Data Stack Show

Latest episodes

undefined
Nov 5, 2021 • 6min

Data Debrief: The Highs and Lows of Open Source Projects

Eric and Kostas break down further topics from episode 60 about stream processing and open source projects.
undefined
Nov 3, 2021 • 1h 7min

60: Architecting a Boring Stream Processing Tool With Ashley Jeffs of Benthos

Highlights from this week’s conversation include:A brief overview of Ashley’s background (2:47)Benthos’ creation and the problems it was meant to address (4:01)Use cases for Benthos (18:25)Key features of Benthos that make it stand out (22:23)Adding windowing to Benthos for fun (29:23)The highs and lows of maintaining an open source project for five years (32:17)The architecture of Benthos (36:23)The importance of ordering in streaming processing (42:15)Gaining traction with an open source project (53:21)Benthos’ blobfish mascot (58:03) The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
Oct 29, 2021 • 14min

Data Debrief: What Open Source Data Projects Have Come Out of Facebook, Whoops, *Meta?

On this week's debrief, Kostas and Eric talk about the variety of open source projects that come from Facebook.
undefined
Oct 27, 2021 • 58min

59: Making ETL Optional with Justin Borgman of Starburst Data

Highlights from this week’s conversation include:Starburst Data is Justin’s second startup (2:42)Starburst focuses on doing data warehousing analytics without the need for the data warehouse (4:14)Multi-cloud solutions among merger and acquisition use cases (8:32)Ways the stack is increasing in complexity (12:25)Comparing essential components of a data stack from 2010 to now (15:01)The future of ETL (27:36)The best maturity stage for an organization to implement Starburst (31:27)Starburst connectors (36:55)Monetizing enterprise solutions while promoting open source ones (41:52)The history of Presto and Trino (45:37)Benefits of a decentralized data mesh (49:53)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
Oct 22, 2021 • 8min

Data Debrief: Will Enterprise Build The Future of Data Tooling?

On this week's Data Debrief, Eric and Kostas dig more into the topic of data tooling.
undefined
Oct 20, 2021 • 50min

58: Data Federation is No Longer The "F" Word with Scott Gnau of InterSystems

Highlights from this week’s conversation include:Solving problems with data has been a long-time passion of Scott’s (2:52)Day-to-day use of data at InterSystems (6:25)The technical aspects involved in constructing a data fabric (17:52)Companies at a variety of maturity levels can adopt a data fabric (26:49) A paradigm shift in the marketplace (28:39)Comparing and contrasting data fabric and data mesh (30:49)Sharing data across the business and not having it siloed in different departments (39:46)Privacy and security within a data fabric (41:22)The future of data fabric and pushing the edge (43:17)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
Oct 15, 2021 • 7min

Data Debrief: Can Tools Help Solve Data Quality Organizational Challenges?

On this Data Debrief, Eric and Kostas are joined by Brian from Rudderstack to talk about Data Quality.
undefined
Oct 13, 2021 • 56min

57: Improving Data Quality Using Data Product SLAs with Egor Gryaznov of Bigeye

Highlights from this week’s conversation include:Egor’s software engineering background and history with Uber (2:19)Experimentation platforms and analytics definitions (7:49)Bigeye’s function and use cases (9:40)Managing the relationship between the data engineer maintaining the pipelines and the downstream teams providing the context (18:49)Pinpointing problems in data compared to problems in software (21:55)Defining data quality at Bigeye (24:13)Machine learning models as a data product (28:38)Determining SLAs (32:22)How Bigeye brings different parties together and addresses natural communication barriers (36:42)Looking at when an organization needs to implement data quality tooling (45:54)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
Oct 6, 2021 • 1h 4min

56: Stream Processing and Observability with Jeff Chao of Stripe

Highlights from this week’s conversation include:Jeff’s history with stream processing (2:52)Working with Mantis to address the impact of Netflix downtime (4:20)Defining observability as operational insight (6:58)Time series data and the value of data today (18:52)Data integration’s shift from batch to streaming (29:34)The current state of change data capture (32:20)How an engineer thinks of the end-user (56:21)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
Sep 29, 2021 • 1h 7min

55: Tables vs. Streams and Defining Real-Time with Pete Goddard of Deephaven Data Labs

Highlights from this week’s conversation include:Pete’s background in data engineering and capital market trading (2:10)Comparison of the tooling from 2012 when Deephaven started with that of today (10:30)Taking a closer look at defining real-time data (19:47)Getting non-technical people, clients, and developers all on the same platform (36:11)Deephaven’s incremental update model (40:25)Kafka, timely data flow, and Deephaven (44:22)Use cases for Deephaven (51:52)Going to GitHub to try out Deephaven (1:02:43)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode