Gnarly Data Waves by Dremio cover image

EP20 - What's New in the Apache Iceberg Project: Updates, PyIceberg, Compute Engines

Gnarly Data Waves by Dremio

00:00

Apache Iceberg's New Branching and Tagging Capabilities

Apache iceberg maintains the state of a particular table at a certain point in time using a concept called snapshot. With this new release, references can now be extended to include tagging and branching of the table. Iceberg's new branching and tagging capability borrows similar concept on the Git world and applies it to the world of data lake and lake house space. It enables data engineering and data science team to keep track of the state or version of an iceberg table using name references.

Play episode from 10:01
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app