The Real Python Podcast

Packaging Data Analyses & Using pandas GroupBy

20 snips
Aug 16, 2024
Christopher Trudeau, a data analysis expert, returns alongside Joshua Cook, an author experienced in Python project organization. They discuss best practices for structuring data analyses with a focus on modern packaging techniques. Trudeau shares insights from his new video course on using pandas GroupBy for effective data manipulation and aggregation. The duo also touches on recent Python community resources, including advancements in package management and strategies for effective data practices.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
ANECDOTE

Setuptools Test Command Removal

  • Setuptools briefly removed the "test" command, breaking many builds.
  • Maintainers quickly restored it after community outcry, highlighting packaging challenges.
INSIGHT

Packaging Data Analyses

  • Data science projects benefit from treating analyses like packages.
  • This approach improves organization and portability, especially for complex projects.
ADVICE

Project Best Practices

  • Create a module for all paths in your project for better organization.
  • Use enums for consistency and type hints for clarity, reducing repetition.
Get the Snipd Podcast app to discover more snips from this episode
Get the app