Software Engineering Radio - the podcast for professional software developers

SE Radio 696: Flavia Saldanha on Data Engineering for AI

16 snips
Nov 25, 2025
Flavia Saldanha, a consulting data engineer and architect specializing in AI readiness, joins to discuss the evolution of data engineering. She highlights the shift from treating data as a service to a product, stressing the importance of ownership and context. Flavia explains modern lakehouse architectures and the integration of vector databases to manage unstructured data for AI. She emphasizes the need for data engineers to embrace product thinking, governance, and NLP skills, positioning AI as an augmenting co-pilot rather than a replacement.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Lakehouse Plus Vectors For AI

  • Stack evolved from monolithic warehouses to lakes to lakehouses augmented with vector databases.
  • Vector stores and embeddings enable unstructured data to join structured analytics for AI.
ADVICE

Build Core Platform Capabilities First

  • Invest in a data marketplace, observability, data-quality checks, and orchestration before scaling.
  • Use observability for freshness, quality tools for anomalies, and orchestration for reliable handoffs.
ADVICE

Embed Governance Early And Actively

  • Embed governance during product design via data contracts, classification, tokenization, and lineage.
  • Treat governance as a guiding principle, not a checklist at the end.
Get the Snipd Podcast app to discover more snips from this episode
Get the app