
Software Engineering Radio - the podcast for professional software developers SE Radio 696: Flavia Saldanha on Data Engineering for AI
16 snips
Nov 25, 2025 Flavia Saldanha, a consulting data engineer and architect specializing in AI readiness, joins to discuss the evolution of data engineering. She highlights the shift from treating data as a service to a product, stressing the importance of ownership and context. Flavia explains modern lakehouse architectures and the integration of vector databases to manage unstructured data for AI. She emphasizes the need for data engineers to embrace product thinking, governance, and NLP skills, positioning AI as an augmenting co-pilot rather than a replacement.
AI Snips
Chapters
Transcript
Episode notes
Lakehouse Plus Vectors For AI
- Stack evolved from monolithic warehouses to lakes to lakehouses augmented with vector databases.
- Vector stores and embeddings enable unstructured data to join structured analytics for AI.
Build Core Platform Capabilities First
- Invest in a data marketplace, observability, data-quality checks, and orchestration before scaling.
- Use observability for freshness, quality tools for anomalies, and orchestration for reliable handoffs.
Embed Governance Early And Actively
- Embed governance during product design via data contracts, classification, tokenization, and lineage.
- Treat governance as a guiding principle, not a checklist at the end.
