The Data Stack Show

171: Machine Learning Pipelines Are Still Data Pipelines with Sandy Ryza of Dagster

Jan 3, 2024
Guest Sandy Ryza, an expert in machine learning pipelines, discusses the role of orchestrators in the lifecycle of data, changes in data ops and MLOps, data cleaning, and the overview of Dagster. They also explore the difference between data assets and tasks in data pipelines, defining lineage and data assets in Dagster, and the benefits of a unified orchestration framework. Additionally, they touch on orchestration in the development phase and the emergence of the analytics engineer role.
Ask episode
Chapters
Transcript
Episode notes