AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Orchestration Layer for Data Lake Management
The orchestration layer in dagster can clone the snowflake scheme to support new changes, managing test environments and data workflows. Different vendors and tools offer varying levels of support, with some requiring custom IO managers. The orchestration layer's built-in capability allows it to manage all test environments, making it ideal for data lake use cases. Additionally, Slike Lake FS supports branch and merge style workflows for data in S3.
The current stage of evolution in the data management ecosystem has resulted in domain and use case specific orchestration capabilities being incorporated into various tools. This complicates the work involved in making end-to-end workflows visible and integrated. Dagster has invested in bringing insights about external tools’ dependency graphs into one place through its "software defined assets" functionality. In this episode Nick Schrock discusses the importance of orchestration and a central location for managing data systems, the road to Dagster’s 1.0 release, and the new features coming with Dagster Cloud’s general availability.
The intro and outro music is from The Hug by The Freak Fandango Orchestra / CC BY-SA
Listen to all your favourite podcasts with AI-powered features
Listen to the best highlights from the podcasts you love and dive into the full episode
Hear something you like? Tap your headphones to save it with AI-generated key takeaways
Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more
Listen to all your favourite podcasts with AI-powered features
Listen to the best highlights from the podcasts you love and dive into the full episode