The Data Stack Show cover image

99: State of the Data Lakehouse with Vinoth Chandar of Apache Hudi

The Data Stack Show

00:00

Is Clustering Really a Data Warehouse?

Clustering changes how you start how you actually back records into fights. This fundamentally affects your compute dollars and it can reduce dramatically reduced cost for your lake. And then the fifth one has to do with what you call like plastering, which is more about like how you can optimize like on a lower level, the data of how it's stored. All right. So that's, that's amazing. My question is, and going back like to the initial question, these are like, let's say additional services that the data lake meets in order to rival a data warehouse. How do we change that? Because not everyone wants to become like a database engineer, right? In order

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app