The Data Stack Show cover image

70: The Difference Between Data Lakes and Data Warehouses with Vinoth Chandar of Apache Hudi

The Data Stack Show

00:00

Stream Processing and Databases in the Data Lake

We borrowed a lot from all the databases as well. Like indexes for example, we have an interesting problem for CDC right. If you so okay you have an upstream like Oracle or Cassandra some oil DP databases taking rights. And I invested a lot and more sort of like so this problem is similar to running a flink job reading from like a Kafka CDC and then updating a state show essentially stream processing principle. We are trying to add more capabilities on the lake than even a typical warehouse today.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app