The New Stack Podcast

Meet Gravitino, a geo-distributed, federated metadata lake

Jan 29, 2026
Junping (JP) Du, founder and CEO of Datastrato and creator of Apache Gravitino, builds open-source, geo-distributed metadata infrastructure. He discusses Gravitino as a catalog-of-catalogs that unifies metadata across engines, its engine-neutral governance role, multimodal and cross-cloud support, v1.1 milestones, and the roadmap toward agent-native metadata and broader engine integration.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Catalog Of Catalogs Unifies Metadata

  • Gravitino is designed as a single, engine-neutral control plane that unifies metadata across engines and clouds.
  • This approach removes catalog silos and provides consistent governance for AI and traditional workloads.
ANECDOTE

Founders Built From Production Pain

  • Junping Du and his colleague built Gravitino from long experience with Hadoop, Spark, cloud warehouses, and lakehouses.
  • They encountered metadata silos, duplicated governance, and missing semantics in large data environments.
INSIGHT

Metadata As The AI Control Plane

  • Metadata must become the control plane to manage semantic layers, access, and governance for AI consumption.
  • A neutral, unified metadata system enables decisions about which engine should access which data and under what rules.
Get the Snipd Podcast app to discover more snips from this episode
Get the app