Data Engineering Podcast

From Data Engineering to AI Engineering: Where the Lines Blur

68 snips
Dec 14, 2025
Explore the evolution of data engineering as it merges with AI. Discover how the transition from Hadoop to cloud warehouses has shaped current practices. Uncover the impact of LLMs and how unstructured data is revolutionizing information retrieval. Delve into operational demands, including uptime and latency, in customer-facing applications. Reflect on the need for collaboration, new testing practices, and a community approach to emerging AI workflows. This journey emphasizes adapting skills to a rapidly changing technological landscape.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Origin Of Data Engineering

  • Data engineering arose to give data scientists reliable, cleaned data so models could be productive.
  • Tobias Macey explains the role grew from warehousing and BI needs as datasets and hiring surged.
INSIGHT

From Hadoop To Cloud Warehouses

  • The shift from Hadoop to cloud warehouses changed how engineers structure and serve data.
  • Tobias Macey highlights Snowflake, Redshift and columnar engines enabling repeatable, reliable pipelines.
INSIGHT

Blurring Between Data And AI

  • Generative AI and LLMs have blurred boundaries between data, ML, and AI engineering.
  • Tobias Macey notes unstructured data now must be prepared for probabilistic models, changing deterministic workflows.
Get the Snipd Podcast app to discover more snips from this episode
Get the app