The Data Stack Show cover image

70: The Difference Between Data Lakes and Data Warehouses with Vinoth Chandar of Apache Hudi

The Data Stack Show

00:00

Data Lakes - What's the Difference?

Uber is looking to merge its data lake and Delta Lake architecture. The two teams inside the company don't use like to completely different stacks for their work. Data lakes are about high throughput rights, these these transactions are in database terms very large transactions. So you cannot really afford to have one of them fail,. And then you could see a lot more on-demand compute and all this cloud. We took a very approach because we were focused on streaming CDC data in all of those incremental use cases.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app