The Real Python Podcast

Improving Classification Models With XGBoost

57 snips
Aug 25, 2023
Author and Python trainer Matt Harrison discusses his new book on improving classification models with XGBoost. He emphasizes the importance of exploratory data analysis and provides tools to explain models to stakeholders. The podcast also covers the popularity of XGBoost, the concept of prediction in data science, and the application of XGBoost in classification models.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
ANECDOTE

Book Born From Trainer Frustration

  • Matt Harrison wrote the book to scratch his own itch after corporate training showed gaps in existing XGBoost resources.
  • He built a more actionable, hands-on guide because available material lacked the practical depth he wanted.
INSIGHT

What Tabular Prediction Really Means

  • Tabular prediction means supervised learning on structured rows and columns like spreadsheets or databases.
  • It contrasts with unstructured data such as images, audio, or video which lack a fixed tabular shape.
ADVICE

Prepare Your Environment First

  • Know Python and basic tooling like Jupyter and pandas before using the book's XGBoost examples.
  • Use Jupyter for interactive exploration but handle preprocessing and cleaning outside the book's focus.
Get the Snipd Podcast app to discover more snips from this episode
Get the app