Data Engineering Podcast cover image

Version Your Data Lakehouse Like Your Software With Nessie

Data Engineering Podcast

00:00

Comparing NESSI and Lake FS Data Ecosystems

When comparing NESSI and Lake FS in data ecosystems, NESSI focuses on capturing metadata changes while Lake FS captures deltas in actual files. NESSI is more suitable for capturing metadata changes, like when an iceberg table is updated with an insert, resulting in creating multiple new files. On the other hand, Lake FS captures deltas in the actual files by adding and subtracting files to reflect changes in the data. Both projects emerged around the same time and initially aimed to use 'get' semantics but realized the need for different abstractions due to the nature and volume of data changes. NESSI opts for metadata change capture, while Lake FS takes the approach of file delta capture.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner
Get the app