
70: The Difference Between Data Lakes and Data Warehouses with Vinoth Chandar of Apache Hudi
The Data Stack Show
00:00
Stream Processing and Databases in the Data Lake
We borrowed a lot from all the databases as well. Like indexes for example, we have an interesting problem for CDC right. If you so okay you have an upstream like Oracle or Cassandra some oil DP databases taking rights. And I invested a lot and more sort of like so this problem is similar to running a flink job reading from like a Kafka CDC and then updating a state show essentially stream processing principle. We are trying to add more capabilities on the lake than even a typical warehouse today.
Transcript
Play full episode