

The Data Stack Show
Rudderstack
Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.
Episodes
Mentioned books

Jan 6, 2021 • 53min
19: Defining Data Governance with Stephen Bailey from Immuta
This week on The Data Stack Show, Kostas and Eric are joined by Stephen Bailey, Director of Applied Data Science at Immuta. Immuta is a startup that focuses on enabling data teams to have really fast, efficient, and understandable access controls on their data. Highlights from this week’s episode include:The problem that Immuta solves (2:04)Stephen’s background researching how the brain works (4:56)Immuta’s stack (15:09)Leveraging metadata (18:02)The main use case for Immuta is simplifying the access control layer (20:06)Unifying data (31:52)Defining the quality of data (34:04)Learning to trust the numbers (39:42)What’s next for Immuta (46:15)The Data Stack Show is a weekly podcast powered by RudderStack. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Dec 31, 2020 • 55min
18: Data Science in Health Insurance with Jason Haupt of Bind
This week on The Data Stack Show, Kostas and Eric are joined by Jason Haupt, data science lead at Bind, a no-deductible health insurance company determined to give immediate answers and clear costs before point of care. Jason’s unique background of having a Ph.D. in particle physics and working at the Large Hadron Collider at CERN have informed the way he goes about approaching data at Bind.Highlights from this week’s episode include:Jason’s background in particle physics and his path to Bind (2:53)A cloud-only approach to data and utilizing AWS (9:01)Focusing on activities that help its members (12:08)Dealing with 12,000 columns of data from an insurance claim form (17:13)Rethinking the relationship between marketing and product teams (25:28)Examining the data pipeline (29:30)Privacy and security concerns with medical information (35:45)How experience with the LHC impacted the way he thinks about data (40:06)Transition from academic work to industry (46:20)The Data Stack Show is a weekly podcast powered by RudderStack. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Dec 9, 2020 • 57min
17: Working with Data at Netflix with Ioannis Papapanagiotou
This week on The Data Stack Show, Kostas and Eric are joined by Ioannis Papapanagiotou, senior engineering manager at Netflix. Ioannis oversees Netflix’s data storage platform and its data integration platform. Their conversation highlighted the various responsibilities his lean teams have, utilizing open source technology and incorporating change data capture solutions.Key points in this week’s episode include:Ioannis’ background with academia and Netflix (4:42)Comparing the data storage and data integration teams (6:19)Discussing indexing and encryption (20:31)Netflix’s role in the open source community (27:21)Implementing change data capture (40:42)Using Bulldozer to efficiently move data in batches from data warehouse tables to key-value stores (42:43)The Data Stack Show is a weekly podcast powered by RudderStack. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Dec 3, 2020 • 46min
16: Applying the Event Sourcing Pattern at Scale with Andrew Elster from Earnnest
On this week’s episode of The Data Stack Show, Kostas and Eric finish part two of a conversation about Earnnest, a digital platform originally designed for facilitating real estate transactions. In the previous episode, they talked with the CTO and co-founder Daniel Jeffords, and in this week’s episode, they talked with the other co-founder, Andrew Elster, CIO and chief architect. Andrew describes more about Earnnest’s stack and their decision to utilize Elixir and talks about their vision for scaling up their product.Key topics in the conversation include:Andrew’s journey from electrical engineering, to avoiding pirates in oceanic oil exploration, to starting Earnnest (2:57)Keeping the platform flexible to expand beyond real estate transactions (10:24)Being adaptable to support existing workflows (18:33)The evolution of the database and implementing event sourcing (25:01)Using a functional language like Elixir (30:54)Developing Earnnest with scale in mind (37:33)The Data Stack Show is a weekly podcast powered by RudderStack. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Nov 19, 2020 • 48min
15: Early Stage Analytics and Learning from the Y Combinator Experience with Axel Delafosse from Pool
This week on The Data Stack Show, Kostas and Eric are joined by Axel Delafosse, founder and CEO of Pool, a messaging app designed to help couples spend less time deciding what to do and spend more time together. Axel shares his story of how he went from having his idea being shot down in person by Paul Graham to being accepted for Y Combinator. While Pool is still a young startup, Axel offers wise insight from lessons he’s learned along the way.Highlights from this week’s episode include:Pool Messenger, “the ultimate antidote to decision paralysis” (2:50)Pitching to Paul Graham and applying to YC (6:17)The importance of the co-founder relationship (14:01)The YC experience and losing Facebook’s API (17:37)Products die, relationships last (22:05)Breaking down the data stack (28:50)Using data and conversations with users to evaluate the experience (36:12)The Data Stack Show is a weekly podcast powered by RudderStack. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Nov 11, 2020 • 48min
14: Breaking Down Electronic Money Transfers and Modernizing Real Estate Transactions with Dan Jeffords of Earnnest
This week on The Data Stack Show, Kostas and Eric chat with Daniel Jeffords, CTO and co-founder of Earnnest, a financial tool for the real estate industry. Earnnest’s digital platform allows buyers to securely and electronically deposit funds directly to an escrow holder and keeps agents, buyers, and escrow holders in the loop with automated emails and tracking information.Highlights from this week’s episode include:Earnnest’s approach to the way payments are handled in an antiquated real estate industry (2:12)Clearing up the differences in the way money changes hands, ACH, wire, and checks (12:39)How Earnnest works and who are the involved parties (21:06)Disrupting a highly regulated industry (24:24)Emphasizing security and transparency (30:09)Erlang, Elixir, Dwolla and more. How Earnnest uses data (33:40)Trying very hard to store very little data (42:58)The Data Stack Show is a weekly podcast powered by RudderStack. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Nov 6, 2020 • 40min
13: Building Open Source Products at Scale with Reza Shafii from Kong Inc.
This week on The Data Stack Show, Reza Shafii, vice president of products at Kong Inc. discusses open source projects and products with Kostas and Eric. Kong is a cloud connectivity company best known for being the creator and primary supporter of Kong, the most widely adopted open-source micro service API gateway.Highlights from this week’s episode include:Being a self-proclaimed middleware geek (2:17)Middleware explained (5:41)Kong as a company, open source project, and a brand (10:44)Drawing the lines between the open source and property parts of a SaaS platform (24:22)Dealing with the extra friction in adopting middleware from the bottom up (33:02)The Data Stack Show is a weekly podcast powered by RudderStack. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Oct 28, 2020 • 56min
12: Building a CDP on your Data Warehouse with Nicholas Ziech-Lopez of MessageGears
In this episode of The Data Stack Show, hosts Kostas Pardalis and Eric Dodds talk with Nicholas Ziech-Lopez, director of product strategy at MessageGears. MessageGears is designed to reduce data friction for marketers by connecting directly to a brand’s data source and using their live data. This episode centered around the world of CDPs and where MessageGears fits in that space.Highlights from this week’s episode include:Nicholas’ arrival at MessageGears and the company’s background (2:20)MessageGears data sources (6:52)Accessing the data warehouses (9:19)Coordination and crossover of data and marketing roles (20:57)Being a customer marketing platform (31:43)Dealing with messy data (36:04)Bridging the physical and digital world with consumers (43:49)What’s coming up next for MessageGears (51:09)The Data Stack Show is a weekly podcast powered by RudderStack. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Oct 21, 2020 • 59min
11: Why Modern Cyber Security is a Data Problem with Jack Naglieri of Panther Labs
This week’s episode of The Data Stack Show features a conversation with hosts Kostas Pardalis and Eric Dodds and guest Jack Naglieri, founder and CEO of Panther Labs. Panther, a San Francisco based startup, is an open platform that helps security teams detect and respond to breaches in cloud-native environments, providing a modern alternative to traditional SIEMs.Highlights from this week’s episode include:Introduction to Jack and Panther Labs (2:33)The different pillars of data security (10:24)Onboarding process for a company using Panther (18:40)Thinking of security as a data problem (24:55)Using S3 and other infrastructure suggestions that will be helpful in the long run (32:16)Use cases for analyzing past and real-time data (39:20)Panther’s data stack (42:54)Open source technology being helpful for the community (47:57)The future for Panther (54:39)The Data Stack Show is a weekly podcast powered by RudderStack. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Oct 14, 2020 • 56min
10: The Evolution of the BI Market with Huy Nguyen of Holistics
In this week’s episode of The Data Stack Show, Kostas Pardalis and Eric Dodds are joined by CTO and Co-Founder of Holistics, Huy Nguyen. Holistics takes an approach to business intelligence and data analytics that they call DataOps. They focus on data team productivity and company-wide access to insights. Important points in the conversation included:Introduction to Huy and Holistics (3:12)Approaching BI with more than just visualization (8:59)How friction between different roles within an organization is addressed by Holistics (15:20)Holistics as a complementary tool (23:25)Describing their own data stack (34:47)History of BI and trends for the future (39:33)The Data Stack Show is a weekly podcast powered by RudderStack. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.