The Data Engineering Show

Revolutionizing Data Governance with DataStrato’s Unified Open Source Approach

6 snips
Apr 8, 2025
Lisa Cao, Product Manager at DataStrato, dives into the world of data governance, sharing her expertise in AI/ML and open-source frameworks. The discussion highlights Apache Gravitino's unique capabilities, enabling unified governance across diverse data systems. They tackle the 'Push-Down Permission Management' model, essential for security, and the growing trend towards open ecosystems that prioritize flexibility. Lisa also emphasizes the importance of real-world tool adoption versus social media hype, keeping data engineers agile in a fast-paced landscape.
Ask episode
AI Snips
Chapters
Books
Transcript
Episode notes
INSIGHT

Gravitino's Unified Approach

  • Gravitino, similar to Unity Catalog and Polaris, offers a unified data governance layer.
  • It supports multiple catalog systems, including Iceberg, Hive Metastore, and JDBC, differentiating it from others.
INSIGHT

Model Catalog Integration

  • Gravitino's model catalog integrates AI/ML and big data, enabling basic model versioning.
  • It allows tagging and linking models with training datasets, offering an end-to-end view.
INSIGHT

Push-Down Permission Management

  • Gravitino uses a push-down permission management model, simplifying governance across systems.
  • It translates unified RBAC into lower-level systems like AWS IAM or MySQL privileges.
Get the Snipd Podcast app to discover more snips from this episode
Get the app