
120: Materialize Origins: A Timely Dataflow Story with Arjun Narayan and Frank McSherry
The Data Stack Show
00:00
How to Migrate Views From Hive to Any System in an Automated Way
The architecture as it is right now, if I understand correctly, there's a completely serverless like experience. Within the clusters you create replicas. These are the executors. If you will, the cluster is where you aim the query or in the views that you like to maintain and then you provision them with bits of resources. You can either build material as user. You can build indexes on view. One of them gets sort of sunk back out to S three and one and lives in memory index form. So you might create cluster prod. Create cluster test. Create cluster interns. And the interns at the same time are doing like 10 way cross joins on data that they shouldn't have
Transcript
Play full episode