The Data Stack Show

Rudderstack
undefined
Oct 20, 2021 • 50min

58: Data Federation is No Longer The "F" Word with Scott Gnau of InterSystems

Highlights from this week’s conversation include:Solving problems with data has been a long-time passion of Scott’s (2:52)Day-to-day use of data at InterSystems (6:25)The technical aspects involved in constructing a data fabric (17:52)Companies at a variety of maturity levels can adopt a data fabric (26:49) A paradigm shift in the marketplace (28:39)Comparing and contrasting data fabric and data mesh (30:49)Sharing data across the business and not having it siloed in different departments (39:46)Privacy and security within a data fabric (41:22)The future of data fabric and pushing the edge (43:17)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
Oct 15, 2021 • 7min

Data Debrief: Can Tools Help Solve Data Quality Organizational Challenges?

On this Data Debrief, Eric and Kostas are joined by Brian from Rudderstack to talk about Data Quality.
undefined
Oct 13, 2021 • 56min

57: Improving Data Quality Using Data Product SLAs with Egor Gryaznov of Bigeye

Highlights from this week’s conversation include:Egor’s software engineering background and history with Uber (2:19)Experimentation platforms and analytics definitions (7:49)Bigeye’s function and use cases (9:40)Managing the relationship between the data engineer maintaining the pipelines and the downstream teams providing the context (18:49)Pinpointing problems in data compared to problems in software (21:55)Defining data quality at Bigeye (24:13)Machine learning models as a data product (28:38)Determining SLAs (32:22)How Bigeye brings different parties together and addresses natural communication barriers (36:42)Looking at when an organization needs to implement data quality tooling (45:54)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
Oct 6, 2021 • 1h 4min

56: Stream Processing and Observability with Jeff Chao of Stripe

Highlights from this week’s conversation include:Jeff’s history with stream processing (2:52)Working with Mantis to address the impact of Netflix downtime (4:20)Defining observability as operational insight (6:58)Time series data and the value of data today (18:52)Data integration’s shift from batch to streaming (29:34)The current state of change data capture (32:20)How an engineer thinks of the end-user (56:21)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
Sep 29, 2021 • 1h 7min

55: Tables vs. Streams and Defining Real-Time with Pete Goddard of Deephaven Data Labs

Highlights from this week’s conversation include:Pete’s background in data engineering and capital market trading (2:10)Comparison of the tooling from 2012 when Deephaven started with that of today (10:30)Taking a closer look at defining real-time data (19:47)Getting non-technical people, clients, and developers all on the same platform (36:11)Deephaven’s incremental update model (40:25)Kafka, timely data flow, and Deephaven (44:22)Use cases for Deephaven (51:52)Going to GitHub to try out Deephaven (1:02:43)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
Sep 22, 2021 • 1h 9min

54: The Center of the Modern Data Stack with Neil Rahilly of Mixpanel

Highlights from this week’s conversation include:Neil’s programming hobby turned into a career and how he cold-contacted Mixpanel for a job (2:28)Lessons learned from nine years at Mixpanel (5:05)Defining product analytics (8:06)How Mixpanel has evolved into the product it is today (10:56)The importance of Mixpanel’s real-time analysis (19:52)Looking at Arb, Mixpanel’s own arbitrary segmentation database (23:44)The business impact that the rise of the cloud data warehouse had on Mixpanel (34:56)Sub-second latencies and real-time use cases (49:05)Career advice from Neil (1:02:02)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
Sep 15, 2021 • 1h 20min

53: What Religion, a Cult, and a Tech Product Have in Common, with Bart Farrell of DoKC

Highlights from this week’s conversation include:Bart’s journey from southern California, to New York, to Egypt, to London, to Spain (3:31)Exposure to different communities and finding shared language and experience (10:21)Looking back at early online communities and how they furthered your learning journey (27:50)How the level of niche-ness impacts a community (44:06)The cautionary tale of WeWork (57:28)Surefire community killers (1:03:44)Open source communities in tech and the passion that drives them (1:08:11)Follow the Data on Kubernetes Community at DoK.community and on Twitter at @DoKCommunity. You can follow Bart at @birthmarkbart.The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
Sep 8, 2021 • 1h 9min

52: Discussing Data Warehouses, Lakes, and Meshes with James Serra of EY

Highlights from this week’s conversation include:James’ background at Microsoft and current work with EY’s data fabric (2:22)The external and internal facing components of EY’s data fabric (6:39)The importance of the data lineage (11:29)The most important requirements for data quality (15:32)Looking at the data capabilities of Microsoft (21:30)The data warehouse, explained (29:00)Using a data warehouse or a data lake (34:33)Defining the buzzword data mesh (51:13)The problem with data mesh (59:31)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
Sep 1, 2021 • 55min

51: Democratizing AI and ML with Tristan Zajonc of Continual

Topics in this wide-ranging conversation include: Tristan’s background with Cloudera and the need for continual operational ML and AI (3:15)How the complexity of Continual is hidden behind a simplicity of use (14:48)Focusing on data that lives within a data warehouse (18:43)Understanding features in the ML conversation (22:47)The three layers of Continual (26:11)The importance of SQL to Continual (30:19)Caching layers and the data warehouse centric approach (38:28)Betting on the warehouse being a central component of data stack architecture (43:34)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
Aug 25, 2021 • 59min

50: From Data Infrastructure to Data Management with Ananth Packkildurai

Highlights from this week’s episode:Ananth’s background (2:51)The evolution of Slack (4:54)Kafka and Presto’s two of the most reliable and flexible tools for Ananth (9:43)How Snowflake gained an advantage over Presto (13:24)Opinions about data lakes (17:23)Core features of data infrastructure (23:22)The tools define the process, and not the other way around (31:30)Defining a data mesh (36:44)Data is inherently social in nature (40:31)Lessons learned from writing Data Engineering Weekly (49:14)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app