Data Engineering Podcast cover image

Effective Pandas Patterns For Data Engineering

Data Engineering Podcast

00:00

Scaling Out in Pandas

Tobias: If you're playing with a data set in pandas, you probably want three to ten x the amount of memory for some overhead. Pandas often has two or three ways to do something and sometimes it's good, but sometimes that might be dependent on how big your data is. He says he hasn't had much experience with scale out systems such as dask nor spark. tobias: One thing i have seen is that python has this thing called a xeno python. And xenopython said there should be one, and preferably only one, way to do things.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app