

Super Data Science: ML & AI Podcast with Jon Krohn
Jon Krohn
The latest machine learning, A.I., and data career topics from across both academia and industry are brought to you by host Dr. Jon Krohn on the Super Data Science Podcast. As the quantity of data on our planet doubles every couple of years and with this trend set to continue for decades to come, there's an unprecedented opportunity for you to make a meaningful impact in your lifetime. In conversation with the biggest names in the data science industry, Jon cuts through hype to fuel that professional impact.Whether you're curious about getting started in a data career or you're a deep technical expert, whether you'd like to understand what A.I. is or you'd like to integrate more data-driven processes into your business, we have inspiring guests and lighthearted conversation for you to enjoy.We cover tools, techniques, and implementation tricks across data collection, databases, analytics, predictive modeling, visualization, software engineering, real-world applications, commercialization, and entrepreneurship − everything you need to crush it with data science.
Episodes
Mentioned books

Sep 9, 2025 • 1h 12min
921: NPUs vs GPUs vs CPUs for Local AI Workloads, with Dell’s Ish Shah and Shirish Gupta
Using Windows for AI development and the bleeding edge of NPUs: Shirish Gupta and Ish Shah from Dell Technologies speak to Jon Krohn about the latest products from Dell, the future of neural-processing units (NPUs), and how AI developers can make sound hardware investments.
This episode is brought to you by the Trainium2, the latest AI chip from AWS, by ODSC, the Open Data Science Conference and by Gurobi.
Additional materials: www.superdatascience.com/921
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
In this episode you will learn:
(04:18) Why Windows still outranks other operating systems
(20:58) The difference between GPUs and NPUs
(32:44) How to access and use Dell’s NPUs and GPUs
(49:08) Using processing units on the cloud versus locally
(57:43) About the Dell Pro Max

Sep 5, 2025 • 22min
920: In Case You Missed It in August 2025
This month’s episode of In Case You Missed It gives us reasons to be cautiously optimistic about the future of large language models (LLMs), with guests discussing what to do about recent reports that found AI agents blackmailed human users when threatened, the importance of post-training LLMs, and the training we have available for data and AI engineers to create robust, secure, and useful AI. Jon Krohn includes clips from his interviews with Akshay Agrawal (Episode 911), Julien Launay (Episode 913), Michelle Yi (Episode 915), and Kirill Eremenko (Episode 917).
Additional materials: www.superdatascience.com/920
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.

Sep 2, 2025 • 1h 30min
919: Hopes and Fears of AGI, with All-Time Bestselling ML Author Aurélien Géron
PyTorch, AGI, and the future of alignment research: Aurélien Géron joins Jon Krohn in this live interview to talk about the fourth edition of his bestselling Hands-On Machine Learning as well as what superintelligence makes him hopeful for, as well as what concerns him about machines surpassing human intelligence.
This episode is brought to you by Gurobi and by the Dell AI Factory with NVIDIA
Additional materials: www.superdatascience.com/919
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
In this episode you will learn:
(02:04) Why Aurélien wrote Hands-On Machine Learning
(20:54) How Aurélien came to decide on material for the new edition
(28:53) Aurélien’s predictions for AGI
(51:21) How to support alignment research
(1:13:42) Does superintelligence mean super-capability

Aug 29, 2025 • 9min
918: Multi-Agent Systems with CrewAI
In this Five-Minute Friday, Jon Krohn introduces listeners to CrewAI, an open-source Python framework that can create and manage multi-agent teams. The clue is in the title: CrewAI assembles specialized agents into single “crews” that achieve complex goals between them. CrewAI’s agent teams can also learn and iterate, meaning that after the crew has achieved its goals for the first time, they can refine and tailor their approach to future goals.
Additional materials: www.superdatascience.com/918
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.

Aug 26, 2025 • 1h 16min
917: 8 Steps to Becoming an AI Engineer, with Kirill Eremenko
Founder of SuperDataScience, Kirill Eremenko, talks to Jon Krohn about how he found the best tools and approaches to help launch his 8-week AI engineering bootcamp. He breaks down the topics participants cover each week, and he also shares his tips with listeners who might want to start their own tech bootcamp or sign up for SuperDataScience’s September 2025 cohort.
This episode is brought to you by the Dell AI Factory with NVIDIA and by ODSC, the Open Data Science Conference
Additional materials: www.superdatascience.com/917
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
In this episode you will learn:
(10:58) Weeks 1-4 of the SuperDataScience bootcamp
(37:52) How to use AI to drive the bottom line in business
(47:50) Weeks 5-8 of the SuperDataScience bootcamp
(54:50) How to convert LLMs to agents
(1:09:33) Jon’s feedback on the SuperDataSciencebootcamp

Aug 22, 2025 • 10min
916: The 5 Key GPT-5 Takeaways
GPT-5 has just been released, but with not very much fanfare. In this Five-Minute Friday, Jon Krohn asks if GPT-5 deserves the community’s underwhelmed response to its release. He outlines five features of the model and explains why people might be feeling less than enthusiastic in the broader context of LLM development. Which LLMs are leading the way, and which are still playing the game of catch-up?
Additional materials: www.superdatascience.com/916
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.

Aug 19, 2025 • 1h 10min
915: How to Jailbreak LLMs (and How to Prevent It), with Michelle Yi
Tech leader, investor, and Generationship cofounder Michelle Yi talks to Jon Krohn about finding ways to trust and secure AI systems, the methods that hackers use to jailbreak code, and what users can do to build their own trustworthy AI systems. Learn all about “red teaming” and how tech teams can handle other key technical terms like data poisoning, prompt stealing, jailbreaking and slop squatting.
This episode is brought to you by Trainium2, the latest AI chip from AWS and by the Dell AI Factory with NVIDIA.
Additional materials: www.superdatascience.com/915
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
In this episode you will learn:
(03:31) What “trustworthy AI” means
(31:15) How to build trustworthy AI systems
(46:55) About Michelle’s “sorry bench”
(48:13) How LLMs help construct causal graphs
(51:45) About Generationship

Aug 15, 2025 • 26min
914: Data Lakes 101 (and Why They’re Key for AI Models), with Oz Katz
In this Five-Minute Friday, Cofounder and CTO of lakeFS Oz Katz talks to Jon Krohn about data warehouses, data lakes, and how companies can handle increasingly complex data infrastructures and formats. Hear about lakeFS’s collaboration with Legofest, lakeFS’s approach to helping users collaborate on data lakes, and how to overcome the challenges of working with multimodal data.
Additional materials: www.superdatascience.com/914
This episode is brought to you by the Dell AI Factory with NVIDIA.

Aug 12, 2025 • 1h 15min
913: LLM Pre-Training and Post-Training 101, with Julien Launay
Julien Launay launched Adaptive to give data science teams in business enterprises their “RLOps tooling” to make reinforcement learning easier. Talking to Jon Krohn, Julien says, “Most of our users are data scientists who write Python codes to interface with the system”. Adaptive is also able to work with companies without data science teams, collaborating with partners like Deloitte to add the necessary personnel. Julien is currently working on making his platform more widely available.
Additional materials: www.superdatascience.com/913
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.

Aug 8, 2025 • 33min
912: In Case You Missed It in July 2025
In this episode of In Case You Missed It, we look back on five great interview episodes from July. Hear from Lilith Bat-Leah (Episode 901), Sinan Ozdemir (Episode 903), Sebastian Gehrmann (Episode 905), Zohar Bronfman (Episode 907) and Robert Ness (Episode 909). They’ll tell you why data-centric machine learning is so important across disciplines, starting with law, and how we can use AI benchmarks and “red teaming” to refine our search for the best AI models.
Additional materials: www.superdatascience.com/912
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.