Data Engineering Podcast cover image

Effective Pandas Patterns For Data Engineering

Data Engineering Podcast

00:00

Scaling Scale Out, Multicore Architectures?

Data quality and data provenades are definitely corps issues in building data engineering tools. The other thing that we've touched on briefly is this question of scaling from a single machine to saying, ok, now i need to process wou know, parabites or petabites of data. And so then you start to jump into these tool chains of dask or modin or ray or enof the calas library. I do hear a common complaint people ar like, why would i use pan das? It's like a single machine, so i'm a big data person. That doesn't make sense to use that.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app