
The New Stack Podcast Meet Gravitino, a geo-distributed, federated metadata lake
Jan 29, 2026
Junping (JP) Du, founder and CEO of Datastrato and creator of Apache Gravitino, builds open-source, geo-distributed metadata infrastructure. He discusses Gravitino as a catalog-of-catalogs that unifies metadata across engines, its engine-neutral governance role, multimodal and cross-cloud support, v1.1 milestones, and the roadmap toward agent-native metadata and broader engine integration.
AI Snips
Chapters
Transcript
Episode notes
Catalog Of Catalogs Unifies Metadata
- Gravitino is designed as a single, engine-neutral control plane that unifies metadata across engines and clouds.
- This approach removes catalog silos and provides consistent governance for AI and traditional workloads.
Founders Built From Production Pain
- Junping Du and his colleague built Gravitino from long experience with Hadoop, Spark, cloud warehouses, and lakehouses.
- They encountered metadata silos, duplicated governance, and missing semantics in large data environments.
Metadata As The AI Control Plane
- Metadata must become the control plane to manage semantic layers, access, and governance for AI consumption.
- A neutral, unified metadata system enables decisions about which engine should access which data and under what rules.

