

The Test Set by Posit
Posit, PBC
A Posit podcast for data science junkies, anomaly hunters, and those who play outside the confidence interval. Hosted by Michael Chow, with co-hosts Wes McKinney & Hadley Wickham.
Episodes
Mentioned books

Sep 25, 2025 • 1h 7min
Michael Chow: From psychology and Python to constrained creativity
For this episode, we turn the mic around. Wes McKinney takes over the interviewer’s chair to chat with his co-host, Michael Chow. Michael’s a principal software engineer at Posit, but he started out studying how people think — literally, with a PhD in cognitive psychology. Somewhere along the way, he got hooked on data science, helped build adaptive learning tools at DataCamp, and now spends his days thinking about how to make Python easier to use and more fun.The two dig into what drives Michael’s curiosity, how a “weird obsession with tables” turned into a beloved open source project, and the future of data science/scientists.Episode Notes:We explore Michael’s path from studying the mind to shaping the Python data science ecosystem. From adaptive learning platforms to Great Tables, Michael shares how following unexpected curiosities can spark tools and communities that last.What’s Inside:Michael’s pivot from an academic career in data scienceBehind-the-scenes messiness of building data and learning platformsOpen source projects born out of zany, single-minded passionsBringing beauty to rows and columnsBig-picture thoughts on where data science — and open source tooling — are headed

Aug 26, 2025 • 45min
Roger Peng: Sustaining data science — in classrooms, code, and conversations
Roger Peng, a Professor of Statistics and Data Science at UT Austin and co-host of Not So Standard Deviations, discusses his unique journey in data science. He shares insights from his early projects that shaped his passion for R and the importance of hands-on experience in education. The conversation dives into the dynamics of podcasting and how to maintain meaningful content over time. Roger also emphasizes the evolving roles of programming languages like R and SQL, and the community's pivotal role in shaping the landscape of data science.

Aug 11, 2025 • 55min
Mine Çetinkaya-Rundel: Teaching in the AI era — and keeping students engaged
Mine Çetinkaya-Rundel, a data science educator at Duke University and Posit, shares her compelling journey from actuarial science to the classroom. She discusses her innovative teaching philosophy, the 'whole game' approach that keeps students engaged. The conversation dives into her use of AI, specifically LLMs, for instant feedback on assignments. Mine also addresses the importance of manual coding skills in the age of AI and reflects on the unique relationship between the R and Python communities in fostering collaboration and open-source learning.

Jul 29, 2025 • 27min
Wes McKinney: Part 2 — The open source hustle and an insider view of Positron
In the latest discussion, Wes McKinney, an open source software developer and creator of pandas, delves into the intricacies of maintaining open source projects. He explains the essential role of corporate support and shares the origin story of Apache Arrow. The conversation highlights the development of Positron, a cutting-edge IDE, and the importance of interoperable tools across programming languages. Wes also touches on how modern IDEs should cater to both users and machine learning models, blending his coding insights with a passion for music.

Jul 14, 2025 • 23min
Wes McKinney: Part 1 — Building Pandas, Arrow, and a speedrunning legacy
Wes McKinney, an influential open-source software developer and the mastermind behind Pandas and Apache Arrow, shares his journey from speedrunning communities to revolutionizing data processing. He discusses how frustration with data work sparked Pandas' creation and dives into the early days of organizing the GoldenEye speedrunning scene. Wes also reflects on the importance of community in open-source development, the challenges of optimizing CSV handling, and the thrill of collaborating with like-minded creators in the data space.

Jun 30, 2025 • 28min
Spreadsheets, bikes, and the accidental empire of R packages — with Hadley Wickham
Hadley Wickham, Chief Scientist at Posit and creator of the Tidyverse, shares his journey from a spreadsheet-loving teen to a major figure in data science. He reflects on how messy Excel sheets led to the tidy data revolution and discusses the challenges of developing R packages. The conversation touches on maintaining focus amidst burnout, his fascination with integrating LLMs into data workflows, and how writing books helps tidy ideas. Fun anecdotes, including a custom bike inspired by his Shiny textbook, add a personal touch to his insights.