

Super Data Science: ML & AI Podcast with Jon Krohn
Jon Krohn
The latest machine learning, A.I., and data career topics from across both academia and industry are brought to you by host Dr. Jon Krohn on the Super Data Science Podcast. As the quantity of data on our planet doubles every couple of years and with this trend set to continue for decades to come, there's an unprecedented opportunity for you to make a meaningful impact in your lifetime. In conversation with the biggest names in the data science industry, Jon cuts through hype to fuel that professional impact.Whether you're curious about getting started in a data career or you're a deep technical expert, whether you'd like to understand what A.I. is or you'd like to integrate more data-driven processes into your business, we have inspiring guests and lighthearted conversation for you to enjoy.We cover tools, techniques, and implementation tricks across data collection, databases, analytics, predictive modeling, visualization, software engineering, real-world applications, commercialization, and entrepreneurship − everything you need to crush it with data science.
Episodes
Mentioned books

12 snips
Sep 9, 2025 • 1h 12min
921: NPUs vs GPUs vs CPUs for Local AI Workloads, with Dell’s Ish Shah and Shirish Gupta
Ish Shah and Shirish Gupta from Dell Technologies share their expertise in AI hardware innovation. They explore the competitive landscape of NPUs versus GPUs and the advantages of using Windows for AI development. Listeners learn about Dell's cutting-edge products, including the new Pro Max mobile workstation with a discrete NPU. The conversation delves into optimizing local versus cloud AI workloads, decision-making in hardware investments, and the importance of future-proofing technology for evolving AI applications.

12 snips
Sep 5, 2025 • 22min
920: In Case You Missed It in August 2025
Discover the evolving landscape of large language models and the critical post-training phase that enhances their capabilities. Gain insights into troubling AI behaviors like blackmail and the importance of user security. Learn about a comprehensive AI engineering bootcamp that prepares aspiring engineers for real-world challenges. Plus, explore Marimo, a tool that revolutionizes data workflows, promoting seamless collaboration and efficiency in AI projects.

78 snips
Sep 2, 2025 • 1h 30min
919: Hopes and Fears of AGI, with All-Time Bestselling ML Author Aurélien Géron
Aurélien Géron, author of 'Hands-On Machine Learning,' shares his journey and insights into the fourth edition of his bestselling book. He discusses the pivotal shift from TensorFlow to PyTorch, emphasizing hands-on learning for innovation. Aurélien expresses both hopes and fears regarding AGI, addressing ethical dilemmas and alignment challenges. He also highlights the urgency of aligning AI with human values, reflecting on its potential transformative role in education and society. This engaging conversation balances optimism with caution in AI's rapidly evolving landscape.

79 snips
Aug 29, 2025 • 9min
918: Multi-Agent Systems with CrewAI
Discover the innovative CrewAI, an open-source Python framework for creating multi-agent teams that tackle complex challenges. Learn how these specialized agents collaborate and iterate to refine their approaches over time. The discussion highlights the efficiency and productivity boosts achieved through automation and collaboration. Dive into the exciting applications of these multi-agent systems across various fields, including software development and content creation, while also examining the importance of governance in their implementations.

93 snips
Aug 26, 2025 • 1h 16min
917: 8 Steps to Becoming an AI Engineer, with Kirill Eremenko
Kirill Eremenko, Founder of SuperDataScience, shares insights from his innovative AI engineering bootcamp. He outlines the curriculum's key topics, spanning from foundational Python skills to advanced AI system deployment. Kirill reveals how to utilize AI for business impact and explains the transformation of concepts into practical applications. He also discusses the importance of mindset shifts for AI engineers and highlights the role of employer sponsorship in education, providing a roadmap for aspiring tech educators.

104 snips
Aug 22, 2025 • 10min
916: The 5 Key GPT-5 Takeaways
GPT-5 has arrived, but its release has sparked more questions than excitement. The latest model shows incremental improvements in handling complex tasks, with significant advancements in reasoning and safety. Discussing why the community's response might be lackluster, the conversation dives into how GPT-5 measures up against leading LLMs. It also highlights its potential applications in software development and the wider AI landscape, setting the stage for innovative uses of language models.

70 snips
Aug 19, 2025 • 1h 10min
915: How to Jailbreak LLMs (and How to Prevent It), with Michelle Yi
Michelle Yi, a tech leader and cofounder of Generationship, dives into the intriguing world of AI security. She discusses the methods hackers use to jailbreak AI systems and shares strategies for building trustworthy ones. The concept of 'red teaming' emerges as a critical tool in identifying vulnerabilities, while Yi also emphasizes the ethical implications of AI and the importance of community support for female entrepreneurs in tech. Get ready to explore the complexities of adversarial attacks and the steps needed to safeguard AI technologies!

119 snips
Aug 15, 2025 • 26min
914: Data Lakes 101 (and Why They’re Key for AI Models), with Oz Katz
Oz Katz, Cofounder and CTO of lakeFS, shares his expertise on data lakes, essential for modern AI applications. He highlights the differences between data lakes and data warehouses, emphasizing their roles in managing complex data infrastructures. Katz discusses lakeFS's collaboration with Legofest, the challenges of handling multimodal data, and how version control can enhance team collaboration. He also explores the revolutionary shift towards object storage and the integration of vector databases to improve data accessibility and efficiency.

100 snips
Aug 12, 2025 • 1h 15min
913: LLM Pre-Training and Post-Training 101, with Julien Launay
Julien Launay, Co-founder and CEO of AdaptiveML, shares insights on how his company simplifies reinforcement learning for data science teams, enhancing AI accessibility in businesses. He explores his tech journey from Minecraft to developing advanced AI tools. Key discussions include the importance of reward functions in AI integration, the technical nuances of reinforcement learning algorithms, and the challenges of data quality. Julien also reveals plans to democratize AI, fostering innovation across various industries by making advanced models more widely available.

43 snips
Aug 8, 2025 • 33min
912: In Case You Missed It in July 2025
Explore the importance of data-centric machine learning in legal tech, tackling noisy data challenges. Delve into low resource languages and the impactful DMLR initiative. Discover the shift from traditional to data-centric methods emphasizing dataset quality, particularly in finance. Uncover how neuroscience informs AI predictions about human behavior, enhancing business decisions. Finally, dive into causal AI's potential for predicting user actions in gaming, highlighting practical tools like PyTorch.