5min chapter

The Data Stack Show cover image

70: The Difference Between Data Lakes and Data Warehouses with Vinoth Chandar of Apache Hudi

The Data Stack Show

CHAPTER

Open Source Is the Way to Go Right?

Park a was designed in an era where mostly on premise DFS right so you had to care a lot about storage space. But if you now don't care as much would you do certain things differently? I'm not into it but I'm pretty sure there's something that is better that can come out in the future. So what I really care about again going back is whether the services are open. Right. Can you cluster a snowflake table outside of snowflake. If you don't buy that maybe there is someone who can use AI and super cluster your tables automatically, like one clustering algorithm. That's super interesting. Do you think this is good to change from data lakes usually

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode