The Data Stack Show cover image

70: The Difference Between Data Lakes and Data Warehouses with Vinoth Chandar of Apache Hudi

The Data Stack Show

00:00

Is There a Gap in Lake House Architecture?

The lake has no servers for both data and metadata today. The way that we think so evolved with like Delta Lake or iceberg right where you stick metadata into a file. That's not going to be like performance if you compare to what let's say snowflake task which is like keep metadata in another world TV horizontally scalable old database like foundation, for example. We are trying to think of it a model where we have servers for metadata and we keep the data plane like kind of serverlesswhere in a spark jobs should be able to access the raw direct Right.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app