

The Data Stack Show
Rudderstack
Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.
Episodes
Mentioned books

Mar 9, 2022 • 1h 6min
78: The Etymology of Reverse ETL & Why It’s a Key Piece Of The Modern Data Stack with Boris Jabes of Census
Highlights from this week’s conversation include:Boris’ background career journey (2:32)The origins of “reverse ETL” (6:39)Reverse Fivetran (16:35)Product as an experience (22:41)Fivetran users vs Census users (24:14)How to add value to a data dump (26:56)Ways companies are creating IP (33:48)The cascade effect of the modern data stack (37:56)Defining “data federation” (43:51)Lessons from building a product (49:10)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Mar 4, 2022 • 3min
The PRQL: Reverse ETL and the Distinction Between Operation vs Analysis on Data
Eric and Kostas preview their upcoming conversation with Borris Jabes of Census.

Mar 2, 2022 • 1h 1min
77: Standardizing Unstructured Data with Verl Allen of Claravine
Highlights from this week’s conversation include:Verl’s career journey (2:46)M&A data evaluation criteria (7:12)What Claravine does (10:48)The breadth of data (15:03)Adding to content and advertising data (18:22)How Claravine standardizes data (23:53)Designing a data model (25:40)The underlying technologies of building a product (33:43)The main consumer (35:02)Maintaining quality (39:06)Helping solidify definitions (41:37)Implementing Claravine’s model across various companies (44:54)Internal changes affect on the model (46:47)Connection brought about by structure (49:19)Applying unstructured context to structured stamping (52:36)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Feb 25, 2022 • 6min
The PRQL: If Everything Is Data, How Can We Make Sense of It All?
Eric and Kostas preview their upcoming conversation with Verl Allen of Claravine.

Feb 23, 2022 • 52min
76: Why a Data Team Should Limit Its Own Superpowers with Sean Halliburton of CNN
Highlights from this week’s conversation include:Sean’s career journey (3:27)Optimization and localized testing results (7:49)Denying potential access to more data (13:46)Other dimensions data has (18:32)The other side of capturing events (20:55)Data equivalent of API contracts (25:03)SDK restrictiveness for developers (27:40)How to know if you’re still sending the right data (30:38)Debugging that starts in a client of a mobile app (36:08)Communicating about data (38:36)The next phase of tooling (41:49)Advice for aspiring managers (45:21)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Feb 18, 2022 • 4min
The PRQL: How Important Is the Human Factor When Working With Data?
Eric and Kostas preview their upcoming show with Sean Halliburton of Warnermedia.

Feb 16, 2022 • 59min
75: How To Become a Data Engineer with Parham Parvizi of the Data Stack Academy
Highlights from this week’s conversation include:Par’s background and current role (2:48)About Talend (6:46)Nonlinear pathways to data engineering roles (11:08)What a data engineer needs to be successful (17:37)Before “data engineer” was a title (27:59)Signs you should be a data engineer (32:39)Curiosity and data engineering (38:31)Defining the modern data stack (45:07)How to get a feel for data engineering (52:52)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Feb 11, 2022 • 4min
The PRQL: Can We Define the Role of the Data Engineer (Yet)?
In this PRQL, Eric and Kostas preview their upcoming conversation with Parham Parvizi of tura.io.

Feb 9, 2022 • 45min
74: Kostas Respawns at Starburst, is Interviewed by Eric, and Reminisces About Winamp
Highlights from this week’s conversation include:Big News: podcast hits, Kostas’ career change (2:19)Kostas’ career start in data pipelines (4:09)The Winamp and Napster era (11:46)Starting an API gateway (16:56)Observing new technology from afar (23:43)Starting Blendo (32:38)Problems faced in architecting the product (37:12)Kostas’ role at Starburst (40:25)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Feb 4, 2022 • 3min
The PRQL: What Prompts a Conversation About Winamp & Quake Arena on The Data Stack Show?
Eric and Kostas preview some exciting news coming up on episode 74 of the Data Stack Show.