The Data Stack Show cover image

120: Materialize Origins: A Timely Dataflow Story with Arjun Narayan and Frank McSherry

The Data Stack Show

00:00

How to Migrate Views From Hive to Any System in an Automated Way

The architecture as it is right now, if I understand correctly, there's a completely serverless like experience. Within the clusters you create replicas. These are the executors. If you will, the cluster is where you aim the query or in the views that you like to maintain and then you provision them with bits of resources. You can either build material as user. You can build indexes on view. One of them gets sort of sunk back out to S three and one and lives in memory index form. So you might create cluster prod. Create cluster test. Create cluster interns. And the interns at the same time are doing like 10 way cross joins on data that they shouldn't have

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app