How Do I Maintain My Iceberg Tables?

In Spark, you can actually choose copyright or merge on read for an operation. In Trino, there are stored procedures to help you do that. You know, rewrite data files, look for anything with more than three delete files. That's the approach that I think data architecture is moving to and we want to sort of lead that change so that data engineers don't think about this anymore, right? Where again, this is SQL behavior. How often in Postgres are you thinking, hmm, are things fragmented? I maybe run an optimized, like hopefully not daily or hourly.

Play episode from 25:33

Transcript

Episode notes

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app

The View Below The Waterline Of Apache Iceberg And How It Fits In Your Data Lakehouse

Data Engineering Podcast

How Do I Maintain My Iceberg Tables?

Summary

Announcements

Interview

Contact Info

Parting Question

Closing Announcements

Links

The AI-powered Podcast Player