The Data Stack Show

Rudderstack
undefined
Jun 21, 2023 • 1h 15min

143: Collaborative Data Analytics on the Data Warehouse, featuring Rob Woollen & Stipo Josipovic of Sigma

Highlights from this week’s conversation include:Stipo and Rob’s background in data (2:43)What is Sigma? (7:46)Takeaways from building analytics products in-house (9:16)Sigma’s approach to datastore interface (11:32)Why analytics and BI are still not a solved problem (15:50)Combining SQL and spreadsheets for useful interface (23:17)The evolution of BI to today (29:40)Overcoming the challenges of collaboration in working with data (33:17)Creating operational coding that humans can understand (46:50)The future of BI (54:00)Cloud’s impact on BI and analytics (1:00:04)The value of getting close to the data for analytics (1:02:21)Final thoughts and takeaways (1:08:45)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.
undefined
Jun 19, 2023 • 6min

The PRQL: Modern Analytics Using Common Paradigms, Featuring Rob Woollen & Stipo Josipovic of Sigma

In this bonus episode, Eric and Kostas preview their upcoming conversation with Rob Woollen & Stipo Josipovic of Sigma. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.
undefined
Jun 16, 2023 • 24min

Shop Talk: Why AI Is Not Another Crypto

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.
undefined
Jun 14, 2023 • 57min

142: Martech’s Separation and Return to Data Infrastructure with Scott Brinker of HubSpot

Highlights from this week’s conversation include:Scott’s background in martech (3:10)Where things have gone wrong between IT and marketing (5:46)The explosion of digital marketing data (12:04)Costs of having data siloed (16:14)The convergence of marketing and IT teams around data (19:27)Navigating the massive landscape of martech tools (26:10)Needed tools in the martech stack (31:11)The importance of an accurate attribution model (34:37)Building tooling for marketers and developers to use (39:20)Future areas of development in the martech space (44:46)Final thoughts and takeaways (52:40)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.
undefined
Jun 12, 2023 • 4min

The PRQL: Marketing, Martech, and Data with Scott Brinker of HubSpot

In this bonus episode, Eric and Kostas preview their upcoming conversation with Scott Brinker of HubSpot. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.
undefined
Jun 7, 2023 • 58min

141: A Journey From Backend Engineer to Data Engineer with Ioannis Foukarakis of Mattermost

Highlights from this week’s conversation include:Ioannis’ background and journey in data (2:42)Rudderstack’s transformations feature and examples of its application (4:20)Winning the transformations contest at Rudderstack (7:21)How Ioannis’ transformation project works for data governance (9:40)Memories from college for Ioannis and Kostas (12:30)Getting into the world of software development (17:27)The changes in data and engineering over the years (20:29)Bridging java with python (23:15)Dealing with ML workloads in the past vs. workflows of today (26:30)Data engineers and ML engineers (33:12)Dealing with data in the early stages to ensure reliability later on (38:39)What creates problems with data quality? (42:11)Exciting developments in data engineering (46:48)Final thoughts and takeaways (51:12)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.
undefined
Jun 5, 2023 • 6min

The PRQL: The Portability of Engineering Fundamentals with Ioannis Foukarakis of Mattermost

In this bonus episode, Eric and Kostas preview their upcoming conversation with Ioannis Foukarakis of Mattermost. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.
undefined
May 31, 2023 • 1h 2min

140: Stream Processing for Machine Learning with Davor Bonaci of DataStax

Highlights from this week’s conversation include:Davor’s journey from Google and what he was building there (3:32)How work in stream processing changed Davor’s journey (5:10)Analytical predictive models and infrastructure (9:39)How Kaskada serves as a recommendation engine with data (14:05)Kaskada’s user experience as an event processing platform (20:06)Enhancing typical feature store architecture to achieve better results (23:34)What is needed to improve stream and batch processes (27:39)Using another syntax instead of SQL (36:44)DataStax acquiring Kaskada and what will come from that merger (40:24)Operationalizing and democratizing ML (47:54)Final thoughts and takeaways (56:04) The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.
undefined
May 29, 2023 • 5min

The PRQL: Kaskada Serving as a Recommendation Engine with Davor Bonaci of DataStax

In this bonus episode, Eric and Kostas preview their upcoming conversation with Davor Bonaci of DataStax. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.
undefined
4 snips
May 24, 2023 • 58min

139: Decoupling the Execution Engine From Python’s Pandas with Aditya Parameswaran of Ponder

Highlights from this week’s conversation include:Aditya’s background and journey in the data space (2:47)What does Ponder do? (5:18)101 on Pandas and why people utilize it (6:42)The challenge of translating Pandas to a big data platform (16:11)Data Warehouses and ML workflows (21:27)The differences in the “zoo” of data languages (26:56)Why do ML and data engineering have to be so different in languages? (34:39)Builders should be adapting to the users and not the other way around (39:32)Will we see a singular data interface in the future? (46:19)Aditya’s most surprising discovery in his research (50:40)Final thoughts and takeaways (53:18)Read more of Aditya's work: Pandas vs. SQL – Part 1: The Food Court and the Michelin-Style RestaurantPandas vs. SQL – Part 2: Pandas Is More ConcisePandas vs. SQL – Part 3: Pandas Is More FlexiblePandas vs. SQL – Part 4: Pandas Is More ConvenientThe Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app