5min chapter

The Data Stack Show cover image

70: The Difference Between Data Lakes and Data Warehouses with Vinoth Chandar of Apache Hudi

The Data Stack Show

CHAPTER

The Story of Hoodie Inside of Uber

Hoodie was created to help Uber deal with the huge amount of data that they were dealing with. The company had a typical on-prem data warehouse, but couldn't fit all its volumes into it. So hoodie built out how to data lake most people did and used stream processing systems like CDC change capture. Hoodie is now being trialled in more than 100 cities around the U.S., including some new ones this week.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode