
Data Engineering Podcast From Data Engineering to AI Engineering: Where the Lines Blur
68 snips
Dec 14, 2025 Explore the evolution of data engineering as it merges with AI. Discover how the transition from Hadoop to cloud warehouses has shaped current practices. Uncover the impact of LLMs and how unstructured data is revolutionizing information retrieval. Delve into operational demands, including uptime and latency, in customer-facing applications. Reflect on the need for collaboration, new testing practices, and a community approach to emerging AI workflows. This journey emphasizes adapting skills to a rapidly changing technological landscape.
AI Snips
Chapters
Transcript
Episode notes
Origin Of Data Engineering
- Data engineering arose to give data scientists reliable, cleaned data so models could be productive.
- Tobias Macey explains the role grew from warehousing and BI needs as datasets and hiring surged.
From Hadoop To Cloud Warehouses
- The shift from Hadoop to cloud warehouses changed how engineers structure and serve data.
- Tobias Macey highlights Snowflake, Redshift and columnar engines enabling repeatable, reliable pipelines.
Blurring Between Data And AI
- Generative AI and LLMs have blurred boundaries between data, ML, and AI engineering.
- Tobias Macey notes unstructured data now must be prepared for probabilistic models, changing deterministic workflows.
