

The Data Stack Show
Rudderstack
Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.
Episodes
Mentioned books

Apr 15, 2024 • 42min
Data Council Week: How To Do Self-Service Data Analytics and Business Intelligence Right with Ryan Dolley of GoodData
Highlights from this week’s conversation include:Ryan’s background in data (0:58)Transition from Performing Arts to Data (2:23)Understanding End Users in Data Projects (6:08)Learning from Failures in Data Projects (8:07)The self-service era (19:50)Struggles of self-service (21:23)The disillusion with dashboards (26:23)GoodData's approach (30:06)Merging wisdom with modern approach (31:50)User experience with GoodData (34:05)Defining metrics and AI (36:35)Connecting with Ryan and GoodData (39:26)Final thoughts and takeaways (41:06)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Apr 10, 2024 • 1h 30min
185: The Evolution of Data Processing, Data Formats, and Data Sharing with Ryan Blue of Tabular
Ryan Blue, expert in data processing and metadata formats, discusses the evolution of data processing, challenges in transitioning to S3, impact of latency on query performance, designing a new metadata format, and the trade-offs in writing workloads. He also explores the vendor influence on access controls, restructuring data security, exciting releases and future plans, and the fundamental shift in data architecture.

Apr 8, 2024 • 5min
The PRQL: The Two Parallel Tracks of Development In Data Processing with Ryan Blue of Tabular
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Apr 3, 2024 • 58min
184: Kafka Streams and Operationalizing Event Driven Applications with Apurva Mehta of Responsive
Highlights from this week’s conversation include:Apruva’s background in streaming technology (0:48)Developer experience and Kafka streams (2:47)Motivation to bootstrap a startup (4:09)Meeting the Confluent founders and early work at Confluent (6:59)Projects at Confluent and transition to engineering management (10:34)Overview of Responsive and event-driven applications (12:55)Defining event-driven applications (15:33)Importance of latency and state in event-driven applications (18:54)Low Latency and Stateful Processing (21:52)In-Memory Storage and Evolution of Kafka (25:02)Motivation for KSQL and Kafka Streams (29:46)Category Creation and Database-like Interface (34:33)Developer Experience with Kafka and Kafka Streams (38:50)Kafka Streams Functionality and Operational Challenges (41:44)Metrics and Tuning Configurations (43:33)Architecture and Decoupling in Kafka Streams (45:39)State Storage and Transition from RocksDB (47:48)Future of Event-Driven Architectures (56:30)Final thoughts and takeaways (57:36)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Apr 1, 2024 • 4min
The PRQL: Event-Driven Applications: Where Low Latency Meets High Impact with Apurva Mehta of Responsive
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

13 snips
Mar 27, 2024 • 1h 3min
183: Why Modern Data Quality Must Move Beyond Traditional Data Management Practices with Chad Sanderson of Gable.ai
Data expert Chad Sanderson discusses modern data quality and management practices on this podcast. Topics include challenges with the modern data stack, rethinking data catalogs, AI impact on data, incentivizing engineers for data quality, and the role of AI in data semantics. The conversation also touches on data as a product, quantifying the cost of data changes, and the importance of slowing down to go faster in data management.

Mar 25, 2024 • 8min
The PRQL: The Data Supply Chain with Chad Sanderson of Gable.ai
Chad Sanderson, founder of Gable.ai, talks about managing data upstream, collaboration among engineering teams, and comparing data pipelines to supply chains using McDonald's as an example for optimization and efficiency.

Mar 20, 2024 • 1h 1min
182: Building a Dynamic Data Infrastructure at Enterprise Scale Featuring Kevin Liu of Stripe
Kevin Liu from Stripe discusses evolving data infrastructure, speech recognition work at Amazon, metadata analysis surprises, product sizing, data pipelining, and the future of open source projects in data infrastructure.

Mar 18, 2024 • 6min
The PRQL: Exploring the Intersection of Software Engineering and Data Management with Kevin Liu of Stripe
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Mar 13, 2024 • 60min
181: OLAP Engines and the Next Generation of Business Intelligence with Mike Driscoll of Rill Data
Mike Driscoll, Co-founder of Rill Data, discusses the evolution of Druid, architectural decisions, user and developer experiences, BI tools, data architecture, AI's impact on BI. He also shares humorous dreams outside of data.