Data Engineering Podcast cover image

The View Below The Waterline Of Apache Iceberg And How It Fits In Your Data Lakehouse

Data Engineering Podcast

00:00

When Do You Run Vacuum on Postgres Tables?

Tabular uses a Git-like model where you basically have a whole bunch of data and metadata trees that are overlapping. You can stage a commit without actually updating the current reference. So we use that at Netflix to do this pattern of sort of integrated audits or write audit publish, we call it. It would basically cherry pick or fast forward the main branch to that commit.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app