5min chapter

The Data Scientist Show cover image

How he carved his own path at Airbnb, from data engineer to CEO of Mage - Tommy Dang - the data scientist show #056

The Data Scientist Show

CHAPTER

Airflow - Data Warehouse Scaling Made Simple

Airflow is a tool that lets you orchestrate and be able to see all your pipelines. We also integrate natively with Spark so that you can handle large, large data sets. And then finally, we treat data as a first class citizen. There's tools like Airflow who are great at orchestrating generic workloads. You can use it for anything. But we just wanted to focus only on data, data workloads, data pipelines,. integrating data, transforming data. Every step in your pipeline actually produces data product. So although this tool has a lot of functionality, what if I only want to use a few or I want to customize a feature? Yes.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode