The Data Exchange with Ben Lorica

Making Data Engineering Safe for Automation and Agents

12 snips
Nov 13, 2025
Ciro Greco, Co-founder and CEO of Bauplan, discusses revolutionizing data engineering by applying software principles like version control and transactional pipelines to data lakes. He highlights the unique challenges of data work, such as scale and fragmentation, and introduces a git-like branching model for enhanced reproducibility. Ciro emphasizes the importance of transactional guarantees, especially for automated agents, and advocates for a code-first approach to enable safe and efficient interactions with data platforms.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Apply Software Engineering Rigor To Data

  • Treat data engineering like software engineering to get reproducibility, rollbacks, and robustness at scale.
  • Software best practices let large teams collaborate without sacrificing application reliability.
INSIGHT

Make Data Changes Versioned And Reproducible

  • Keep data changes versioned and tied to code so every change is reproducible and auditable.
  • You must be able to travel back in time and reproduce deterministic states of the lake.
ADVICE

Enable Team Autonomy With Branches

  • Give product and analytics teams their own cloud branches to develop without central-team bottlenecks.
  • Monitor and roll back those branches so teams stay independent and safe.
Get the Snipd Podcast app to discover more snips from this episode
Get the app