
Super Data Science: ML & AI Podcast with Jon Krohn
The latest machine learning, A.I., and data career topics from across both academia and industry are brought to you by host Dr. Jon Krohn on the Super Data Science Podcast. As the quantity of data on our planet doubles every couple of years and with this trend set to continue for decades to come, there's an unprecedented opportunity for you to make a meaningful impact in your lifetime. In conversation with the biggest names in the data science industry, Jon cuts through hype to fuel that professional impact.Whether you're curious about getting started in a data career or you're a deep technical expert, whether you'd like to understand what A.I. is or you'd like to integrate more data-driven processes into your business, we have inspiring guests and lighthearted conversation for you to enjoy.We cover tools, techniques, and implementation tricks across data collection, databases, analytics, predictive modeling, visualization, software engineering, real-world applications, commercialization, and entrepreneurship − everything you need to crush it with data science.
Latest episodes

22 snips
Sep 17, 2024 • 1h 6min
819: PyTorch: From Zero to Hero, with Luka Anicin
Luka Anicin, CEO of Data Blues and a Udemy instructor with over 500,000 students, dives into the world of PyTorch, promoting its user-friendly approach for newcomers in AI. He discusses the importance of starting simple in machine learning projects, emphasizing data quality over complexity. Luka highlights the advantages of transfer learning for smaller datasets and the necessity of real-world projects to build a solid data science portfolio. Tune in for insights on increasing accuracy and efficiency using PyTorch!

Sep 13, 2024 • 30min
818: In Case You Missed It in August 2024
Shingai Manjengwa, a thought leader on the future of work, discusses the necessity of continuous education in a rapidly evolving job landscape. Daniel Hulme shares insights on structuring AI systems within businesses, while Jerry Yurchisin emphasizes the role of data engineering in a decentralized world. Nick Elprin elaborates on the advantages of decentralization for technology and business, highlighting the transformative power of AI agents in handling complex tasks. Together, they envision a future where learning and adaptability drive organizational success.

23 snips
Sep 10, 2024 • 1h 36min
817: The Positron IDE, Tidy NLP and MLOps with Dr. Julia Silge
Dr. Julia Silge, Engineering Manager at Posit, discusses the innovative Positron IDE designed for data scientists. She shares the key differences between MLOps and classic NLP methods, emphasizing when each is most effective. Julia also reveals her top picks for LLMs that enhance coding productivity and highlights essential open-source libraries for MLOps. Her insights on the importance of tidy data and its relevance in R's tidyverse are particularly enlightening, making this conversation a must-listen for data professionals.

Sep 6, 2024 • 20min
816: Explaining AGI to a 94-Year-Old
In this touching conversation, Jon Krohn chats with his 94-year-old grandmother, Annie, who is diving into data science and AI. They break down what data is and the role of a data scientist in simple terms. Annie learns about the evolution of AI and the implications of artificial general intelligence. The discussion also highlights how automation could transform work, making it optional, enhancing leisure time with loved ones, and improving quality of life. It's a heartwarming example of bridging generations through knowledge.

14 snips
Sep 3, 2024 • 1h 27min
815: Polars: Faster DataFrame Ops, with Marco Gorelli
In this enlightening discussion, Marco Gorelli, a Senior Software Engineer at Quansight Labs and a core developer of the Polars and Narwhals libraries, shares his insights on optimizing data operations. He explains when to use Polars over Pandas and its unique features like lazy evaluation and string optimizations. Marco also delves into the Narwhals library, bridging compatibility with Pandas. He shares his strategies for winning forecasting competitions and addresses the need for greater diversity in data science. Prepare for a deep dive into the future of data manipulation!

Aug 30, 2024 • 4min
814: Summer Reflections
As summer winds down, this episode shifts focus from the usual tech discussions to something more personal: reflecting on the importance of balancing work with life’s simple pleasures. While the world of data science and AI continues to evolve rapidly, it's essential to remember that true success isn't just about professional milestones. It’s also about cherishing the moments that make life meaningful. Tune in for a brief but impactful reflection on how to redefine success to include not just achievements, but also the everyday joys that often go unnoticed.Additional materials: www.superdatascience.com/814Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.

Aug 27, 2024 • 1h 44min
813: Solving Business Problems Optimally with Data, with Jerry Yurchisin
Jerry Yurchisin, a mathematical optimization expert from Gurobi, joins the conversation to explore the power of optimization in solving complex business challenges. He discusses engaging examples like the Burrito Optimization Game, explaining its real-world applications. Key differences between machine learning and optimization are highlighted, emphasizing how the latter can provide clear prescriptive solutions. Yurchisin also touches on the integration of large language models in optimization, advancements in GPU technology, and the complexities of NP-hard problems.

Aug 23, 2024 • 12min
812: The AI Scientist: Towards Fully Automated, Open-Ended Scientific Discovery
In this enlightening discussion, representatives from Sakana AI and experts from the University of Oxford, University of British Columbia, and Vector Institute explore groundbreaking advancements in automated scientific discovery. They introduce the 'AI Scientist,' a revolutionary framework that could transform research processes. The conversation highlights both the immense potential and the ethical quandaries posed by AI in science. Additionally, they address safety concerns surrounding autonomous AI scientists while emphasizing the positive future impact of these technologies.

13 snips
Aug 20, 2024 • 1h 14min
811: Scaling Data Science Teams Effectively, with Nick Elprin
Nick Elprin, a data science expert and co-founder of Domino Data Lab, shares his insights on scaling data science teams effectively. He discusses the importance of tailored AI solutions, emphasizing that there's no one-size-fits-all approach. The conversation covers when to integrate AI tools into businesses and the significance of community in navigating the complexities of generative AI. Elprin also reflects on his journey in launching a data science startup and the critical role of mathematics in achieving commercial success.

Aug 16, 2024 • 9min
810: The Five Levels of Self-Driving Cars
Jon Krohn, a knowledgeable expert in automation, dives into the intriguing world of self-driving cars. He explains the five levels of vehicle automation, from Level 0 where humans are in complete control to Level 5, where cars drive themselves in any condition. With firsthand stories from his experiences in autonomous vehicles, Jon captures the excitement and implications of this technology. He breaks down the futuristic possibilities and the current state of advancements, making this a must-listen for tech enthusiasts and curious minds alike.