Data Engineering Podcast cover image

Data Engineering Podcast

Advanced Lakehouse Management With The LakeKeeper Iceberg REST Catalog

Apr 21, 2025
Victor Kessler, co-founder of Vakama and developer of Lakekeeper, dives into the world of advanced lakehouse management with a focus on Apache Iceberg. He discusses the pivotal role of metadata in data actionability and the evolution of data catalogs. Victor highlights innovative features of Lakekeeper, like integration with OpenFGA for access control and its deployment using Rust on Kubernetes. He also addresses the challenges of migrating data catalogs and the importance of community involvement in open-source projects for better data management.
57:13

Episode guests

Podcast summary created with Snipd AI

Quick takeaways

  • Lakekeeper, an Apache Iceberg REST catalog, is crucial for managing metadata, storage, and compute components in lake houses.
  • The integration of OpenFGA with Lakekeeper enables improved access control by facilitating centralized and granular permissions for data management.

Deep dives

AI-Powered Data Migration

Data migrations can often take months or even years, leading to resource strain and diminished team morale. An innovative AI-powered migration agent from Datafold aims to expedite this process, enabling migrations to be completed up to ten times faster than traditional manual methods. This agent utilizes both AI code translation and automated data validation to ensure seamless transitions. The solution's reliability is underscored by a guarantee of timely completion in writing, addressing a significant pain point in data management.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner