The Data Engineering Show cover image

The Data Engineering Show

Revolutionizing Data Governance with DataStrato’s Unified Open Source Approach

Apr 8, 2025
Lisa Cao, Product Manager at DataStrato, dives into the world of data governance, sharing her expertise in AI/ML and open-source frameworks. The discussion highlights Apache Gravitino's unique capabilities, enabling unified governance across diverse data systems. They tackle the 'Push-Down Permission Management' model, essential for security, and the growing trend towards open ecosystems that prioritize flexibility. Lisa also emphasizes the importance of real-world tool adoption versus social media hype, keeping data engineers agile in a fast-paced landscape.
23:36

Episode guests

Podcast summary created with Snipd AI

Quick takeaways

  • Apache Gravitino serves as a unified metadata lake, enabling consistent data governance across various query engines and data catalog systems.
  • The innovative 'Push-Down Permission Management' model allows for enhanced security and streamlined access control across diverse data environments.

Deep dives

Understanding Gravitino and Its Capabilities

Gravitino is an Apache incubating project designed to provide a unified governance and security layer for data catalogs. It allows users to manage various data assets and supports the concept of a meta-catalog, which can inherit functionalities from other catalogs. The latest updates position Gravitino as a flexible solution that integrates not just with traditional models but also with newer formats like Iceberg, enhancing cross-system compatibility. This extensibility is crucial for organizations looking to streamline their data operations across different environments and technologies.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner