The Data Stack Show cover image

The Data Stack Show

Latest episodes

undefined
Nov 20, 2023 • 2min

The PRQL: Building a Data Product for Data People: Looker's Vision and Omni's Future with Colin Zima

In this bonus episode, Eric and Kostas preview their upcoming conversation with Colin Zima of Omni.
undefined
Nov 15, 2023 • 57min

164: How The GTM and Data Teams at Snowflake Work Together with Travis Henry and Hillary Carpio

Highlights from this week’s conversation include:The Unique Perspective of Practitioners (2:10)Account-based Marketing (6:30)Sales Development Representatives (SDR) (8:05)Descriptive, People, and Engagement Data (11:38)Data Overload and Actionable Data (14:20)Working with Data Teams and Internal Data (17:52)The relationship between business and data teams (22:27)The importance of collaboration between marketing and data teams (24:17)Travis and Hillary writing a book (25:33)The taxonomy of personas (34:23)Bucketing and grouping people in data systems (35:37)Account-based marketing and sales alignment (39:00)The data-driven approach and reliance on technology (44:25)Managing complexity in data and account-based marketing (45:35)Adapting to change and evolving data artifacts (51:58)The importance of understanding the business (54:58)Collaboration between data and go-to-market teams (55:56)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
Nov 13, 2023 • 5min

The PRQL: Navigating the World of Data Overload with Travis Henry and Hillary Carpio of Snowflake

In this bonus episode, Eric and Kostas preview their upcoming conversation with Travis Henry and Hillary Carpio of Snowflake.
undefined
4 snips
Nov 8, 2023 • 1h 4min

163: Simplifying Real-Time Streaming with David Yaffe and Johnny Graettinger of Estuary

Highlights from this week’s conversation include:Johnny and David’s background in working together (1:56)The background story of Estuary (4:15)The challenges of ad tech and the need for low latency (5:44)Use cases for moving data at scale (10:35)Real-time data replication methods (11:54)Challenges with Kafka and the birth of Gazette (13:54)Comparing Kafka and Gazette (20:22)The importance of existing streaming tools (22:28)Challenges of managing Kafka and the need for a different approach (23:40)The role of compaction in streaming applications (26:54)The challenge of relaxing state management (34:01)Replication and the problem of data synchronization (36:48)Incremental Back Fills and Risk-Free Production Database (46:03)Estuary as a Platform and Connectors (47:45)The challenges of real-time streaming (57:56)Orchestration in real-time streaming (1:00:51)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
Nov 6, 2023 • 4min

The PRQL: The Shortcomings of Apache Kafka with David Yaffe and Johnny Graettinger of Estuary

In this bonus episode, Eric and Kostas preview their upcoming conversation with David Yaffe and Johnny Graettinger of Estuary.
undefined
Nov 1, 2023 • 57min

162: Accelerating Enterprise AI Transformation With Open Source LLMs Featuring Mark Huang of Gradient

Highlights from this week’s conversation include:The potential of AI-driven applications (1:34)The need for hardware infrastructure in AI experimentation (2:40)Oligopoly on the closed side (11:50)Advantages of private side vs. open source (13:18)Leveraging valuable data within enterprises (16:00)The urgency of adopting LLMs in the enterprise (24:02)Expansion of LLMs into new business verticals (25:06)The challenges of operationalizing LLMs (29:32)Seamless experience with OpenAI (37:29)Operationalizing with Gradient (38:36)The early genesis of Gradient (48:53)The democratization of AI through endpoints (51:44)What is the future of language models? (54:07)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
undefined
Oct 30, 2023 • 4min

The PRQL: How LLMs are Transforming Enterprise Workflows with Mark Huang of Gradient

In this bonus episode, Eric and Kostas preview their upcoming conversation with Mark Huang of Gradient.
undefined
6 snips
Oct 25, 2023 • 1h 21min

161: The Intersection of Generative AI and Data Infrastructure with Chang She of LanceDB

Highlights from the podcast include the challenges in data collection, AI hype impact, LanceDB's file and table format, Vector Database introduction, importance of unstructured data, potential of generative AI, and changing expectations in information systems.
undefined
Oct 23, 2023 • 5min

The PRQL: How Did Pandas Become a Data Science Powerhouse? Featuring Chang She of Eto Labs

In this bonus episode, Eric and Kostas preview their upcoming conversation with Chang She of Eto Labs.
undefined
Oct 18, 2023 • 1h 6min

160: Closing the Gap Between Dev Teams and Data Teams with Santona Tuli of Upsolver

Highlights from this week’s conversation include:Santona’s journey from nuclear physics to data science (4:59)The appeal of startups and wearing multiple hats (8:12)The challenge of pseudoscience in the news (10:24)Approaching data with creativity and rigor (13:22)Challenges and differences in data workflows (14:39)Schema Evolution and Quality Problems (27:01)Real-time Data Monitoring and Anomaly Detection (30:34)The importance of data as a business differentiator (35:48)The SQL job creation process (46:25)Different options for creating solver jobs (47:20)Adding column-level expectations (50:17)Discussing the differences of working with data as a scientist and in a startup (1:00:18)Final thoughts and takeaways (1:04:01)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode