Data Engineering Podcast cover image

Version Your Data Lakehouse Like Your Software With Nessie

Data Engineering Podcast

00:00

Exploring NESSI: A Git-like Versioned Catalog for Data Lakes

In this chapter, Tobias Macy interviews Alex Merced, a developer advocate for Dremio, about the NESSI project, a Git-like versioned catalog for data lakes using Apache Iceberg. They discuss NESSI's core functions, its comparison with lake FS, its role in data lakehouse environments, and its versioning and capability aspects, including integration with Apache Iceberg and maintenance tasks like pruning old versions and running table compactions.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app