The Data Stack Show cover image

70: The Difference Between Data Lakes and Data Warehouses with Vinoth Chandar of Apache Hudi

The Data Stack Show

00:00

The Story of Hoodie Inside of Uber

Hoodie was created to help Uber deal with the huge amount of data that they were dealing with. The company had a typical on-prem data warehouse, but couldn't fit all its volumes into it. So hoodie built out how to data lake most people did and used stream processing systems like CDC change capture. Hoodie is now being trialled in more than 100 cities around the U.S., including some new ones this week.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app