Data Engineering Podcast cover image

Effective Pandas Patterns For Data Engineering

Data Engineering Podcast

00:00

Scaling Out in Pandas

Tobias: If you're playing with a data set in pandas, you probably want three to ten x the amount of memory for some overhead. Pandas often has two or three ways to do something and sometimes it's good, but sometimes that might be dependent on how big your data is. He says he hasn't had much experience with scale out systems such as dask nor spark. tobias: One thing i have seen is that python has this thing called a xeno python. And xenopython said there should be one, and preferably only one, way to do things.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner
Get the app