The Analytics Engineering Podcast cover image

The Analytics Engineering Podcast

Latest episodes

undefined
6 snips
Nov 3, 2023 • 46min

Navigating AI Complexity (w/ Jonathan Frankle)

Jonathan Frankle, Chief Scientist at MosaicML, discusses the future of training specialized models, MosaicML inside Databricks, and responsible AI practices. They explore LLM-based systems, one-way hash functions, and the integration of Databricks platform. The trade-off between model size, cost, and complexity in AI training is also touched upon.
undefined
8 snips
Oct 20, 2023 • 29min

Career Growth in Data Roles (w/ Hubspot's Kasey Mazza at Coalesce 2023)

Kasey Mazza, analytics engineering manager at HubSpot, discusses the roles of data analysts and analytics engineers, building internal data communities, and the evolving landscape of data teams. They also explore career growth and satisfaction, the interplay between organizational structures and tooling, challenges in decision-making and central governance, using DBT across the company, and the future of the data industry.
undefined
Oct 6, 2023 • 42min

Operationalizing Your Warehouse, Streaming Analytics, and Cereal (W/ Arjun Narayan of Materialize and Nathan Bean of General Mills)

Arjun Narayan, CEO of Materialize, and Nathan Bean, a data leader at General Mills, discuss operationalizing warehouses, streaming analytics, and the challenges of manufacturing cereal. They cover the maturation of streaming technology, data management challenges, real-time operational decision-making, managing variation in manufacturing, digital twins for manufacturing line automation, operationalizing warehouses, trade-offs between batch and real-time analytics, the evolution of streaming analytics, and query languages in data analytics.
undefined
Sep 22, 2023 • 40min

Roche’s Data Transformation Journey (w/ Yannick Misteli)

Yannick Misteli is the head of engineering for the go-to-market domain at Roche, a $250 billion multinational pharmaceutical and diagnostics company.  Roche was an early supporter of dbt Cloud, and Yannick helped move his team of 120+ engineers to a modern data stack. He always finds a way to push the boundaries to make a large company founded in 1896 incredibly modern and innovative. We wanted to know more about the "how" of the work—the people, process, and technology.  Read more about Roche's data journey here: https://docs.getdbt.com/blog/dbt-squared
undefined
7 snips
Sep 8, 2023 • 48min

The State of Databases Today (w/ Andy Pavlo)

Andy Pavlo, a professor of databaseology at Carnegie Mellon and founder of OtterTune, talks about the complexity and specialization of database systems, the trend of separating storage and compute, scaling challenges, and the future of the data industry.
undefined
Aug 25, 2023 • 43min

Bring Your Own Data to LLMs (W/ Jerry Liu of LlamaIndex)

Jerry Liu, CEO and co-founder of LlamaIndex, discusses how companies are bringing their data to tailor large language models (LLMs) for their needs. Topics include working on LLMs versus autonomous systems, skill set and data preparation for LOMs, using databases for storing embeddings, capabilities of LMs in analyzing user questions, and exploring agents and specialized microservices in analytics engineering.
undefined
16 snips
Aug 11, 2023 • 49min

Ramp's $8 Billion Data Strategy (W/ Ian Macomber and Ryan Delgado)

Ian Macomber, head of analytics engineering and data science at Ramp and formerly the VP of analytics and data engineering at Drizly, and Ryan Delgado, a staff software engineer at Ramp, have played pivotal roles in establishing Ramp's data team from the ground up and are spearheading the development of their comprehensive roadmap. In this conversation with Tristan and Julia, Ian and Ryan share insights on how Ramp's data team transformed unstructured data from contracts into valuable insights to enable faster decision-making. The $8 billion company values speed and empowers teams to build, ship, and measure products quickly. Ian and Ryan also talked about their approach to adopting new tech and elevating data as an equal player alongside product engineering and design. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.
undefined
Jul 28, 2023 • 31min

dbt Labs on dbt (w/ Daniel Le)

Daniel Le is the CFO at dbt Labs where he has built multiple teams. He is also the former head of FP&A and operations at Zoom, and he helped scale FP&A as the former finance director at Okta.  In this conversation with Julia, Daniel shares his view as CFO on the challenges SaaS companies face and the importance of finance teams creating a holistic view of their business. Daniel gives advice to data leaders about how they can automate business processes with dbt Cloud and use self-service analytics to automate revenue recognition, generate consistent headcount analytics, and more to impact their organization. Read more about Daniel’s story here. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.
undefined
21 snips
Jul 12, 2023 • 48min

The Arc of Data Innovation (w/ Bob Muglia, former CEO of Snowflake)

Bob Muglia likely needs no introduction. The former CEO of Snowflake led the company during its early, transformational years after a long career at Microsoft and Juniper.  Bob recently released the book The Datapreneurs about the arc of innovation in the data industry, starting with the first relational databases all the way to the present craze of LLMs and beyond. In this conversation with Tristan and Julia, Bob shares insights into the future of data engineering and its potential business impact while offering a glimpse into his professional journey.  For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.
undefined
Apr 21, 2023 • 48min

It's 2023, and Privacy Is Now Fun! (w/ Ian Coe of Tonic.ai + Abhishek Bhowmick of Samooha)

Advances in ML have transformed data privacy from a regulatory necessity into an opportunity to improve the work of data people. Synthetic data for modeling + testing is one example of a hard thing that's now easy - and in this conversation with Tristan and Julia, Ian + Abhishek cover many other ways that privacy can actually be a skill that propels your work forward, rather than a mere legal best practice. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode