Data Engineering Podcast cover image

Effective Pandas Patterns For Data Engineering

Data Engineering Podcast

00:00

Scaling Scale Out, Multicore Architectures?

Data quality and data provenades are definitely corps issues in building data engineering tools. The other thing that we've touched on briefly is this question of scaling from a single machine to saying, ok, now i need to process wou know, parabites or petabites of data. And so then you start to jump into these tool chains of dask or modin or ray or enof the calas library. I do hear a common complaint people ar like, why would i use pan das? It's like a single machine, so i'm a big data person. That doesn't make sense to use that.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner
Get the app