

Super Data Science: ML & AI Podcast with Jon Krohn
Jon Krohn
The latest machine learning, A.I., and data career topics from across both academia and industry are brought to you by host Dr. Jon Krohn on the Super Data Science Podcast. As the quantity of data on our planet doubles every couple of years and with this trend set to continue for decades to come, there's an unprecedented opportunity for you to make a meaningful impact in your lifetime. In conversation with the biggest names in the data science industry, Jon cuts through hype to fuel that professional impact.Whether you're curious about getting started in a data career or you're a deep technical expert, whether you'd like to understand what A.I. is or you'd like to integrate more data-driven processes into your business, we have inspiring guests and lighthearted conversation for you to enjoy.We cover tools, techniques, and implementation tricks across data collection, databases, analytics, predictive modeling, visualization, software engineering, real-world applications, commercialization, and entrepreneurship − everything you need to crush it with data science.
Episodes
Mentioned books

May 6, 2025 • 1h 15min
885: Python Polars: The Definitive Guide, with Jeroen Janssens and Thijs Nieuwdorp
Jeroen Janssens and Thijs Nieuwdorp are data frame library Polars’ greatest advocates in this episode with Jon Krohn, where they discuss their book, Python Polars: The Definitive Guide, best practice for using Polars, why Pandas users are switching to Polars for data frame operations in Python, and how the library reduces memory usage and compute time up to 10x more than Pandas. Listen to the episode to be a part of an O’Reilly giveaway!
Additional materials: www.superdatascience.com/885
This episode is brought to you by Trainium2, the latest AI chip from AWS, by Adverity, the conversational analytics platform and by the Dell AI Factory with NVIDIA.
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
In this episode you will learn:
(07:44) Why Jeroen and Thijs wrote Python Polars: The Definitive Guide
(21:54) Best practices in Polars
(25:55) Why Polars has so many users
(34:32) The benefits of the Great Tables package
(51:06) Jeroen and Thijs’ partnership with NVIDIA and Dell for Python Polars: The Definitive Guide

May 2, 2025 • 7min
884: Model Context Protocol (MCP) and Why Everyone’s Talking About It
Model Context Protocol (MCP) is Anthropic’s hottest tool, with over 1,000 community-built MCP servers in operation by February alone. In this Five-Minute Friday, Jon Krohn explains what took so long for users to catch on: Anthropic released MCP in November 2024. Hear more about the buzz behind MCP, its applications, and how easy it is to get started.
Additional materials: www.superdatascience.com/884
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.

Apr 29, 2025 • 1h 4min
883: Blackwell GPUs Are Now Available at Your Desk, with Sama Bali and Logan Lawler
Returning after the “Super Bowl of AI”, NVIDIA GTC, Sama Bali and Logan Lawler talk to Jon Krohn about their respective work at tech giants NVIDIA and Dell. Sama and Logan discuss the next-gen Blackwell GPUs to their collaboration with Dell in launching Pro-Max PCs specially designed to take on heavy computational workloads as well as the incredible performance of GB 10 and GB 300 workstations, and the widening accessibility of AI developer tools and models.
Additional materials: www.superdatascience.com/883
This episode is brought to you by ODSC, the Open Data Science Conference, by Adverity, the conversational analytics platform and by the Dell AI Factory with NVIDIA.
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
In this episode you will learn:
(07:29) About Dell’s Pro Max PCs
(14:01) Why having a Blackwell GPU from Nvidia is a great option for those new to training and deploying AI models
(36:47) When it makes sense for a data scientist to switch from a Unix to a Windows based system
(46:33) Logan’s and Sama’s predictions for AI

Apr 25, 2025 • 10min
882: 40x Hotter Than the Sun: The ASML Machines That Make AI Chips
This week’s five-minute Friday heads to the Netherlands to find out more about Dutch company ASML, the brains behind the lithography machines that build AI chips. Jon Krohn walks through how ASML came to dominate the market, where they’re headed next, and how ASML’s complex machines shape AI chips as well as the very future of AI. Additional materials: www.superdatascience.com/882Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.

Apr 22, 2025 • 1h 17min
881: Beyond GPUs: The Power of Custom AI Accelerators, with Emily Webber
Emily Webber speaks to Jon Krohn about her work at Amazon Web Services, from its Annapurna Labs-developed Nitro System, a foundational technology that can enhance securities and performance in the cloud and how Trainium2 became AWS’ most powerful AI chip with four times the compute of Trainium. Hear the specs of AWS’s chips and when to use them.Additional materials: www.superdatascience.com/881This episode is brought to you by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.In this episode you will learn:
(08:36) Emily’s work on AWS’ SageMaker and Trainium
(23:54) How AWS Neuron lets builders tailor their approach to using frameworks
(29:07) Why using an accelerator is better than using a GPU
(35:29) The key differences between AWS Trainium and AWS Trainium2
(52:45) How to select between AWS Trainium and AWS Trainium2

Apr 18, 2025 • 10min
880: Manus, DeepSeek and China’s AI Boom
First developed in China, Manus AI and DeepSeek have made great waves on an international scale. Sought-after for their cost-effectiveness compared to US-made tech, Manus AI and DeepSeek are quickly becoming dominant technologies inside the country. In this five-minute Friday, Jon Krohn asks: Do these technologies warrant the huge amount of resources spent on them by multiple industries in China, and what makes hype become a mainstay?Additional materials: www.superdatascience.com/880Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.

Apr 15, 2025 • 1h 7min
879: Serverless, Parallel, and AI-Assisted: The Future of Data Science is Here, with Zerve’s Dr. Greg Michaelson
Greg Michaelson speaks to Jon Krohn about the latest developments at Zerve, an operating system for developing and delivering data and AI products, including a revolutionary feature allowing users to run multiple parts of a program’s code at once and without extra costs. You’ll also hear why LLMs might spell trouble for SaaS companies, Greg’s ‘good-cop, bad-cop’ routine that improves LLM responses, and how RAG (retrieval-augmented generation) can be deployed to create even more powerful AI applications.Additional materials: www.superdatascience.com/879This episode is brought to you by Trainium2, the latest AI chip from AWS and by the Dell AI Factory with NVIDIA.Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.In this episode you will learn:
(04:00) Zerve’s latest features
(35:26) How Zerve’s built-in API builder and GPU manager lowers barriers to entry
(40:54) How to get started with Zerve
(41:49) Will LLMs make SaaS companies redundant?
(52:29) How to create fairer and more transparent AI systems
(56:07) The future of software developer workflows

Apr 11, 2025 • 31min
878: In Case You Missed It in March 2025
AI stacks, AGI, training neural networks, and AI authenticity: Jon Krohn rounds up his interviews from March with this episode of “In Case You Missed It”. In his favorite clips from the month, he speaks to Andriy Burkov (Episode 867), Natalie Monbiot (Episode 873), Richmond Alake (Episode 871) and Varun Godbole (Episode 869). Additional materials: www.superdatascience.com/878Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.

Apr 8, 2025 • 1h 10min
877: The Neural Processing Units Bringing AI to PCs, with Shirish Gupta
NPUs, AIPC, and Dell’s growing suite of AI products: Shirish Gupta speaks to Jon Krohn about neural processing units and what makes them a go-to tool for AI inference workloads, reasons to move your workloads from the cloud and to your local devices, what the mnemonic AIPC stands for and why it will soon be on everyone’s lips, and he offers a special intro to Dell’s new Pro-AI Studio Toolkit. Hear about several real-world AIPC applications run by Dell’s clients, from detecting manufacturing defects to improving efficiencies for first responders, massively supporting actual life-or-death situations. Additional materials: www.superdatascience.com/877This episode is brought to you by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.In this episode you will learn:
(03:28) What neural processing units (NPUs) are
(23:53) About Dell Pro AI Studio
(35:03) Use cases for Dell Pro AI Studio
(45:16) How AI development workflows and applications will change
(49:01) About Dell’s AI factory ecosystem

10 snips
Apr 4, 2025 • 15min
876: Hugging Face’s smolagents: Agentic AI in Python Made Easy
Discover how Hugging Face's smolagents are revolutionizing the landscape of agentic AI with their simple yet powerful Python library. Learn about their user-friendly features and multi-step reasoning capabilities that can transform how we develop autonomous AI agents. The discussion also highlights various frameworks, including a comparison with LangChain and Microsoft Autogen, encouraging engagement and feedback from listeners. This is a deep dive into making AI more accessible and efficient for personal and professional use!