The Data Stack Show

Rudderstack
undefined
Apr 15, 2024 • 42min

Data Council Week: How To Do Self-Service Data Analytics and Business Intelligence Right with Ryan Dolley of GoodData

Highlights from this week’s conversation include:Ryan’s background in data (0:58)Transition from Performing Arts to Data (2:23)Understanding End Users in Data Projects (6:08)Learning from Failures in Data Projects (8:07)The self-service era (19:50)Struggles of self-service (21:23)The disillusion with dashboards (26:23)GoodData's approach (30:06)Merging wisdom with modern approach (31:50)User experience with GoodData (34:05)Defining metrics and AI (36:35)Connecting with Ryan and GoodData (39:26)Final thoughts and takeaways (41:06)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
Apr 10, 2024 • 1h 30min

185: The Evolution of Data Processing, Data Formats, and Data Sharing with Ryan Blue of Tabular

Ryan Blue, expert in data processing and metadata formats, discusses the evolution of data processing, challenges in transitioning to S3, impact of latency on query performance, designing a new metadata format, and the trade-offs in writing workloads. He also explores the vendor influence on access controls, restructuring data security, exciting releases and future plans, and the fundamental shift in data architecture.
undefined
Apr 8, 2024 • 5min

The PRQL: The Two Parallel Tracks of Development In Data Processing with Ryan Blue of Tabular

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
Apr 3, 2024 • 58min

184: Kafka Streams and Operationalizing Event Driven Applications with Apurva Mehta of Responsive

Highlights from this week’s conversation include:Apruva’s background in streaming technology (0:48)Developer experience and Kafka streams (2:47)Motivation to bootstrap a startup (4:09)Meeting the Confluent founders and early work at Confluent (6:59)Projects at Confluent and transition to engineering management (10:34)Overview of Responsive and event-driven applications (12:55)Defining event-driven applications (15:33)Importance of latency and state in event-driven applications (18:54)Low Latency and Stateful Processing (21:52)In-Memory Storage and Evolution of Kafka (25:02)Motivation for KSQL and Kafka Streams (29:46)Category Creation and Database-like Interface (34:33)Developer Experience with Kafka and Kafka Streams (38:50)Kafka Streams Functionality and Operational Challenges (41:44)Metrics and Tuning Configurations (43:33)Architecture and Decoupling in Kafka Streams (45:39)State Storage and Transition from RocksDB (47:48)Future of Event-Driven Architectures (56:30)Final thoughts and takeaways (57:36)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
Apr 1, 2024 • 4min

The PRQL: Event-Driven Applications: Where Low Latency Meets High Impact with Apurva Mehta of Responsive

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
13 snips
Mar 27, 2024 • 1h 3min

183: Why Modern Data Quality Must Move Beyond Traditional Data Management Practices with Chad Sanderson of Gable.ai

Data expert Chad Sanderson discusses modern data quality and management practices on this podcast. Topics include challenges with the modern data stack, rethinking data catalogs, AI impact on data, incentivizing engineers for data quality, and the role of AI in data semantics. The conversation also touches on data as a product, quantifying the cost of data changes, and the importance of slowing down to go faster in data management.
undefined
Mar 25, 2024 • 8min

The PRQL: The Data Supply Chain with Chad Sanderson of Gable.ai

Chad Sanderson, founder of Gable.ai, talks about managing data upstream, collaboration among engineering teams, and comparing data pipelines to supply chains using McDonald's as an example for optimization and efficiency.
undefined
Mar 20, 2024 • 1h 1min

182: Building a Dynamic Data Infrastructure at Enterprise Scale Featuring Kevin Liu of Stripe

Kevin Liu from Stripe discusses evolving data infrastructure, speech recognition work at Amazon, metadata analysis surprises, product sizing, data pipelining, and the future of open source projects in data infrastructure.
undefined
Mar 18, 2024 • 6min

The PRQL: Exploring the Intersection of Software Engineering and Data Management with Kevin Liu of Stripe

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
Mar 13, 2024 • 60min

181: OLAP Engines and the Next Generation of Business Intelligence with Mike Driscoll of Rill Data

Mike Driscoll, Co-founder of Rill Data, discusses the evolution of Druid, architectural decisions, user and developer experiences, BI tools, data architecture, AI's impact on BI. He also shares humorous dreams outside of data.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app