

The Data Stack Show
Rudderstack
Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.
Episodes
Mentioned books

Mar 20, 2024 • 1h 1min
182: Building a Dynamic Data Infrastructure at Enterprise Scale Featuring Kevin Liu of Stripe
Kevin Liu from Stripe discusses evolving data infrastructure, speech recognition work at Amazon, metadata analysis surprises, product sizing, data pipelining, and the future of open source projects in data infrastructure.

Mar 18, 2024 • 6min
The PRQL: Exploring the Intersection of Software Engineering and Data Management with Kevin Liu of Stripe
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Mar 13, 2024 • 60min
181: OLAP Engines and the Next Generation of Business Intelligence with Mike Driscoll of Rill Data
Mike Driscoll, Co-founder of Rill Data, discusses the evolution of Druid, architectural decisions, user and developer experiences, BI tools, data architecture, AI's impact on BI. He also shares humorous dreams outside of data.

Mar 11, 2024 • 6min
The PRQL: Making the Data Stack Serverless in the Cloud with Mike Driscoll of Rill Data
Mike Driscoll shares his journey from banks to MetaMarkets to Rill Data, discussing the rise of technologies like Druids and trends in data engines. They explore the transition to serverless frameworks in the cloud for operational business intelligence.

Mar 6, 2024 • 53min
180: Data Observability and AI for Data Operations Featuring Kunal Agarwal of Unravel Data
Kunal Agarwal, CEO of Unravel Data, discusses data operations evolution, Unravel's role, challenges at scale, ROI on data products, cost management, observability in AI, and AI adoption challenges. He shares his journey from fashion to enterprise data management, emphasizing efficiency in cost management and measuring productivity and reliability. The podcast explores diverse technologies in data operations and final takeaways on driving better outcomes with data.

Mar 4, 2024 • 5min
The PRQL: What’s Driving The Evolution of Data Operations? Featuring Kunal Agarwal of Unravel Data
In this bonus episode, Eric and Kostas preview their upcoming conversation with Kunal Agarwal of Unravel Data. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com
for information about our collection and use of personal data for
advertising.

Feb 28, 2024 • 51min
179: Time Series Data Management and Data Modeling with Tony Wang of Stanford University
Stanford University PhD student, Tony Wang, discusses his research focus on time series data management. Topics include challenges in academia and industry, academic lab structure, decision to move from hardware to data research, data modeling in time series, issues and potential solutions for parquet format, and the role of external indices in parquet files.

Feb 26, 2024 • 3min
The PRQL: How is Academic Research Shaping the Future of Data Processing Systems? Featuring Tony Wang of Stanford University
Tony Wang, an academic researcher at Stanford University, discusses his research in data systems and databases, the connection between academia and industry, and shares insights on data processing systems and future trends with the hosts.

Feb 21, 2024 • 57min
178: How to Build a Data Stack to Win PLG, Featuring Peter Chapman
Highlights from this week’s conversation include:Peter's background and journey in data (0:26)Introduction to PLG (4:18)Starting in data at Heroku (6:05)Building the data stack at Heroku (8:13)Data stack requirements for early-stage companies (12:00)Differentiating PLG companies from open source companies (19:26)Venture capital and open source as a lever for growth (22:56)Initial data modeling and analysis (25:38)Operationalizing Data (29:16)Sales and Marketing Operationalization (31:52)Identifying Signals (34:16)Challenges in Developing Signals (37:07)Account Management for Developer Tools (42:30)Challenges in Achieving Margins (45:02)Leveraging Infrastructure for Margins (47:35)Inference vs Training (54:55)Final thoughts and takeaways (57:02)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Feb 19, 2024 • 6min
The PRQL: Building a Future-Proof Data Stack from Day Zero? Featuring Peter Chapman
GTM consultant Peter Chapman discusses the importance of data in business operations, focusing on PLG and financial implications of data-driven tools. The episode also includes a preview of Peter's data journey at Roku and personal connection with the hosts in San Francisco and Silicon Valley.