Data Engineering Podcast cover image

The View Below The Waterline Of Apache Iceberg And How It Fits In Your Data Lakehouse

Data Engineering Podcast

00:00

How Do I Maintain My Iceberg Tables?

In Spark, you can actually choose copyright or merge on read for an operation. In Trino, there are stored procedures to help you do that. You know, rewrite data files, look for anything with more than three delete files. That's the approach that I think data architecture is moving to and we want to sort of lead that change so that data engineers don't think about this anymore, right? Where again, this is SQL behavior. How often in Postgres are you thinking, hmm, are things fragmented? I maybe run an optimized, like hopefully not daily or hourly.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app