
The Analytics Engineering Podcast
Tristan Handy has been curating the Analytics Engineering Roundup newsletter since 2015, pulling together the internet’s best data science & analytics articles.
Tristan and co-host Julia Schottenstein now bring the Roundup to real life, hosting biweekly conversations with data practitioners inventing the future of analytics engineering.
You can view full episode summaries and read back issues of the Roundup newsletter at https://roundup.getdbt.com.
The podcast is sponsored by dbt labs, makers of the data transformation framework dbt. To reach our team, drop a note to podcast@dbtlabs.com.
Latest episodes

Mar 24, 2024 • 48min
How the Media Covers Gen AI (w/ Matthew Lynley, Supervised)
Journalist/data practitioner Matthew Lynley discusses the rise of Gen AI in the media, his journey from mathematics to journalism, encountering Chat GPT, challenges of monetizing an AI newsletter, AI content creation, and the convergence of fields in AI development.

9 snips
Mar 10, 2024 • 48min
AI's Impact in the World of Structured Data Analytics (w/ Juan Sequeda, data.world)
Topics discussed include semantics, text-to-SQL performance, AI's impact on structured data analytics, evolution of semantic web, knowledge graphs, observability in business analysis, GPT-4 for text to SQL, design patterns in LLM interaction, and knowledge management in the data industry.

17 snips
Feb 25, 2024 • 46min
The End of the Modern Data Stack (w/ Benn Stancil, Mode)
Exploring the evolution of the modern data stack, AI integration in analytics, Mode's transformation to a cloud-first company, AI-powered formula builder for SQL queries, redefining the modern data stack, empowering business through IT and data teams, and envisioning the future of the data industry.

5 snips
Dec 8, 2023 • 46min
Data Mesh Architecture at Large Enterprises (w/ Moritz Heimpel and Ben Flusberg)
Moritz Heimpel from Siemens and Ben Flusberg from Cox Automotive discuss their experiences adopting a data mesh architecture and collaborating with data at scale in large organizations. They explore topics such as setting up infrastructure to scale, domain specific ownership of data, data security, and the future of the data industry.

6 snips
Nov 17, 2023 • 44min
Let's Talk About Data Vault (w/ Brandon Taylor and Michael Olschimke)
Brandon Taylor and Michael Olschimke discuss the Data Vault approach, its adoption in Europe, alignment with data mesh architecture, and the ongoing debate over Data Vault vs. Kimball methods. They also explore the impact of DataMesh on data warehouse design, recommended data modeling approaches, and hopes for the data industry.

6 snips
Nov 3, 2023 • 46min
Navigating AI Complexity (w/ Jonathan Frankle)
Jonathan Frankle, Chief Scientist at MosaicML, discusses the future of training specialized models, MosaicML inside Databricks, and responsible AI practices. They explore LLM-based systems, one-way hash functions, and the integration of Databricks platform. The trade-off between model size, cost, and complexity in AI training is also touched upon.

8 snips
Oct 20, 2023 • 29min
Career Growth in Data Roles (w/ Hubspot's Kasey Mazza at Coalesce 2023)
Kasey Mazza, analytics engineering manager at HubSpot, discusses the roles of data analysts and analytics engineers, building internal data communities, and the evolving landscape of data teams. They also explore career growth and satisfaction, the interplay between organizational structures and tooling, challenges in decision-making and central governance, using DBT across the company, and the future of the data industry.

Oct 6, 2023 • 42min
Operationalizing Your Warehouse, Streaming Analytics, and Cereal (W/ Arjun Narayan of Materialize and Nathan Bean of General Mills)
Arjun Narayan, CEO of Materialize, and Nathan Bean, a data leader at General Mills, discuss operationalizing warehouses, streaming analytics, and the challenges of manufacturing cereal. They cover the maturation of streaming technology, data management challenges, real-time operational decision-making, managing variation in manufacturing, digital twins for manufacturing line automation, operationalizing warehouses, trade-offs between batch and real-time analytics, the evolution of streaming analytics, and query languages in data analytics.

Sep 22, 2023 • 40min
Roche’s Data Transformation Journey (w/ Yannick Misteli)
Yannick Misteli is the head of engineering for the go-to-market domain at Roche, a $250 billion multinational pharmaceutical and diagnostics company. Roche was an early supporter of dbt Cloud, and Yannick helped move his team of 120+ engineers to a modern data stack. He always finds a way to push the boundaries to make a large company founded in 1896 incredibly modern and innovative. We wanted to know more about the "how" of the work—the people, process, and technology. Read more about Roche's data journey here: https://docs.getdbt.com/blog/dbt-squared

7 snips
Sep 8, 2023 • 48min
The State of Databases Today (w/ Andy Pavlo)
Andy Pavlo, a professor of databaseology at Carnegie Mellon and founder of OtterTune, talks about the complexity and specialization of database systems, the trend of separating storage and compute, scaling challenges, and the future of the data industry.