Data Engineering Podcast cover image

The View Below The Waterline Of Apache Iceberg And How It Fits In Your Data Lakehouse

Data Engineering Podcast

CHAPTER

How Do I Maintain My Iceberg Tables?

In Spark, you can actually choose copyright or merge on read for an operation. In Trino, there are stored procedures to help you do that. You know, rewrite data files, look for anything with more than three delete files. That's the approach that I think data architecture is moving to and we want to sort of lead that change so that data engineers don't think about this anymore, right? Where again, this is SQL behavior. How often in Postgres are you thinking, hmm, are things fragmented? I maybe run an optimized, like hopefully not daily or hourly.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner