Data Engineering Podcast cover image

The View Below The Waterline Of Apache Iceberg And How It Fits In Your Data Lakehouse

Data Engineering Podcast

CHAPTER

When Do You Run Vacuum on Postgres Tables?

Tabular uses a Git-like model where you basically have a whole bunch of data and metadata trees that are overlapping. You can stage a commit without actually updating the current reference. So we use that at Netflix to do this pattern of sort of integrated audits or write audit publish, we call it. It would basically cherry pick or fast forward the main branch to that commit.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner