
Software Engineering Radio - the podcast for professional software developers SE Radio 696: Flavia Saldanha on Data Engineering for AI
Nov 25, 2025
Flavia Saldanha, a consulting data engineer and architect specializing in AI readiness, joins to discuss the evolution of data engineering. She highlights the shift from treating data as a service to a product, stressing the importance of ownership and context. Flavia explains modern lakehouse architectures and the integration of vector databases to manage unstructured data for AI. She emphasizes the need for data engineers to embrace product thinking, governance, and NLP skills, positioning AI as an augmenting co-pilot rather than a replacement.
AI Snips
Chapters
Transcript
Episode notes
From Data Mover To Product Owner
- Data engineering evolved from moving data (ETL) to treating data as a product with ownership and trust.
- Product thinking forces teams to embed business context and accountability into data pipelines.
Assign Clear Data Product Ownership
- Design dedicated data product owners to own SLAs, semantics, and consumer trust.
- Embed accountability so consumers know the source of truth and whom to contact.
Lakehouse Plus Vectors For AI
- Stack evolved from monolithic warehouses to lakes to lakehouses augmented with vector databases.
- Vector stores and embeddings enable unstructured data to join structured analytics for AI.
