

The Data Stack Show
Rudderstack
Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.
Episodes
Mentioned books

May 15, 2023 • 5min
The PRQL: Data Infrastructure Systems and the Rust / WebAssembly Combo with A.J. Hunyady of InfinyOn
In this bonus episode, Eric and Kostas preview their upcoming conversation with A.J. Hunyady, Founder and CEO of InfinyOn.

May 10, 2023 • 59min
137: Data Collection Secrets & The Search Data Problem with Josh Wills
Highlights from this week’s conversation include:Josh’s background in data working at Google, Slack, and other companies (1:21)The need and process for high quality data (4:33)Digging into auction code (14:03)Joining Slack and working in the early days of the company (18:00)Not fighting the last war in data (25:42)Building a product, while using the product (30:35)Transitioning to the search team at Slack (36:50)Usage patterns of search (41:21)Josh’s work in helping build DuckDB (46:20)Having the right toolset to increase precision and efficiency (52:42)Final thoughts and takeaways (56:03)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

May 8, 2023 • 2min
The PRQL: Data Engineers in the Front End with Josh Wills
In this bonus episode, Eric previews his upcoming conversation with Josh Wills, an experienced data scientist who has worked with IBM, Google, Slack, DuckDB, and more.

May 3, 2023 • 1h
136: System Evolution from Hadoop to RocksDB with Dhruba Borthakur of Rockset
Highlights from this week’s conversation include:Dhruba’s journey into the data space (2:02)The impact of Hadoop on the industry (3:37)Dhruba’s work in the early days of the Facebook team (7:54)Building and implementing RocksDB (14:33)Stories with Mark Zuckerberg at Facebook (24:25)The next evolution in storage hardware (26:14)How Rockset is different from other real-time platforms (33:13)Going from a key value store to an index (37:15)Where does Rockset go from here? (44:59)The success of RocksDB as an open source project (49:11)How do we properly steward real-time technology for impact (51:17)Final thoughts and takeaways (56:18)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

May 1, 2023 • 3min
The PRQL: Hardware Innovation Begets Software Innovation with Dhruba Borthakur Co-Founder and CTO, Rockset
In this bonus episode, Eric and Kostas preview their upcoming conversation with Dhruba Borthakur of Rockset.

Apr 28, 2023 • 15min
Data Council Week (Ep 7) - What’s Next for Data Council? With Pete Soderling of Data Council
Highlights from this week’s conversation include:The origin story of Data Council (0:39)Developments for the future of Data Council (2:42)The emphasis of AI and ChatGPT at this year’s conference (3:54)The support of the data community (5:31)Biggest changes and innovations in the industry (7:10)What’s next for the Data Council? (10:46)Getting connected with Data Council (13:07)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

4 snips
Apr 27, 2023 • 40min
Data Council Week (Ep 6) - All About Debezium and Change Data Capture With Gunnar Morling of Decodable
Gunnar Morling discusses Debezium's replication of data, working with Kafka, importance of documentation in open-source projects, and the vision moving forward. They cover the challenges of CDC open-source solutions and the importance of building a diverse system with common interfaces.

Apr 26, 2023 • 43min
Data Council Week (Ep 5) - The Difference Between Data Platforms and ML Platforms with Michael Del Balso of Tecton
Highlights from this week’s conversation include:Michael’s journey to co-founding Tecton (0:22)The evolution of MLops and platform teams (3:50)Understanding boundaries between the data platform and the MLops (8:42)Differences in machine learning vs data pipelines (16:58)The systems needed to handle all these types of data (22:22)Developer experience in Tecton (25:15)Automating challenges in ML development (32:30)The most difficult part of the life cycle of prediction (37:24)Exciting new developments at Tecton (39:27)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Apr 26, 2023 • 46min
Data Council Week (Ep 4) - Using Data Anonymization for Identity Protection With Will Thompson of Privacy Dynamics
Highlights from this week’s conversation include:Will’s background in data (0:28)Privacy dynamics and data anonymization (4:18)Addressing data privacy problems in the space (10:33)Developer experience with Privacy Dynamics (13:49)How does Privacy Dynamics work? (21:09)Update of real-time anonymized data (26:29)The problem of dates and other complexities in data (31:24)Being a data engineer in a startup (34:44)Moving at the speed of a startup (41:01)Connecting with Will and Privacy Dynamics (43:28)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Apr 25, 2023 • 1h 2min
Data Council Week (Ep 3) - GTM 101 for Engineers With Chase Roberts of Vertex Ventures
Highlights from this week’s conversation include:Chase’s journey to where he is today (0:51)Lessons in go-to-market roles which helps in the VC world (2:38)Differentiating between go-to-market and distribution (8:13)Taking an idea to the market (11:33)Hardest part of the pitch (17:08)Playbooks for go-to-market founders to follow (20:25)Focus of sales and marketing in go-to-market strategy (28:01)Answering the what and how of the problem you are solving (32:30)The importance of pricing in a go-to-market strategy (46:11)Connecting with Chase (1:00:58)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.