
Super Data Science: ML & AI Podcast with Jon Krohn
The latest machine learning, A.I., and data career topics from across both academia and industry are brought to you by host Dr. Jon Krohn on the Super Data Science Podcast. As the quantity of data on our planet doubles every couple of years and with this trend set to continue for decades to come, there's an unprecedented opportunity for you to make a meaningful impact in your lifetime. In conversation with the biggest names in the data science industry, Jon cuts through hype to fuel that professional impact.Whether you're curious about getting started in a data career or you're a deep technical expert, whether you'd like to understand what A.I. is or you'd like to integrate more data-driven processes into your business, we have inspiring guests and lighthearted conversation for you to enjoy.We cover tools, techniques, and implementation tricks across data collection, databases, analytics, predictive modeling, visualization, software engineering, real-world applications, commercialization, and entrepreneurship − everything you need to crush it with data science.
Latest episodes

Jul 19, 2022 • 1h 22min
593: The Real-World Impact of Cross-Disciplinary Data Science Collaboration
Jon welcomes Professor Philip Bourne, Founding Dean of the School of Data Science at the University of Virginia to discuss his biomedical data science research, the importance of open-source and open-access within the industry and the data science skills you need to succeed today.In this episode you will learn:• Why Philip founded a School of Data Science [6:08]• How computing and data science have evolved across academic departments [15:55]• The improvements needed in higher education [26:44]• The most important data science skills for academia and industry and the 4+1 model [36:49]• Philip’s biomedical data science research and its fascinating practical applications [43:24]• The essential roles of open-source code and open-access publishing in data science [1:01:27]Additional materials: www.superdatascience.com/593

Jul 15, 2022 • 3min
592: How to Sell a Multimillion Dollar A.I. Contract
In this episode, Jon Krohn welcomes A.I. industry veteran Ben Taylor to discuss how to sell multimillion dollar A.I. contracts. Tune in to hear why trust and proof of value are some of the critical steps in his sales process.Additional materials: www.superdatascience.com/592

Jul 12, 2022 • 1h 15min
591: Simulations and Synthetic Data for Machine Learning
Mars Buttfield-Addison, PhD Candidate at the University of Tasmania, joins Jon Krohn for a high-energy episode covering everything from Machine Learning simulations to Swift, space junk, and more!In this episode you will learn:• What simulations and synthetic data are, and why they can be invaluable for real-life applications [5:47]• How simulated bots can solve any problem [9:07]• Practical uses of simulated data [21:49]• Why the mobile operating system language Swift is interesting for A.I. [25:46]• Why it's critical to track the amount of junk in space [35:47]• Whether programming or statistical skills are more important in data science [47:05]• What it’s like creating video games in a "secret" games lab [56:45]• Why you might want to do a data science internship in industry before pursuing in academia [ 1:01:54]Additional materials: www.superdatascience.com/591

Jul 8, 2022 • 6min
590: Artificial General Intelligence is Not Nigh (Part 2 of 2)
In this episode, Jon continues his two-part series on artificial general intelligence (AGI) and why we are unlikely to realize it anytime soon. Listen in as Jon reviews Meta's Yann LeCun's seven-part perspective on the topic.Additional materials: www.superdatascience.com/590

Jul 5, 2022 • 56min
589: Narrative A.I. with Hilary Mason
Hilary Mason, Co-Founder and CEO of Hidden Door, joins Jon Krohn for a live discussion that explores narrative A.I., emerging ML techniques, and how her OSEMN data science process developed.In this episode you will learn:
How narrative A.I. can assist creativity [5:14]
How to build ML products that have no quantitative error function to optimize [10:31]
How to ensure creative A.I. systems do not output non-sense or explicit content [16:58]
Hilary's OSEMN data science process [21:05]
The emerging ML technique she’s most excited about [24:58]
What it takes to be successful as CEO of an early-stage A.I. company [27:20]
What she looks for in engineering hires [32:28]
How she’s hopeful A.I. will transform our lives for the better in the decades to come [38:48]
Additional materials: www.superdatascience.com/589

Jul 1, 2022 • 6min
588: Artificial General Intelligence is Not Nigh
In this episode, Jon kicks off a two-part series that sees him explore the popular topic of artificial general intelligence and why it might–or might not–be only a few years away. Listen in as Jon explains the several reasons why he doesn't believe that AGI is nigh.Additional materials: www.superdatascience.com/588

Jun 28, 2022 • 1h 25min
587: Data Engineering for Data Scientists
Mark Freeman, Senior Data Scientist at Humu, joins Jon Krohn to talk about all things data engineering and offers listeners some critical tips for their data science career journey – from what it takes to get promoted to his number one tip for getting hired at a fast-growing capital-backed startup.In this episode you will learn:
How Humu leverages data and machine learning to improve workplace behaviors [10:38]
What is data engineering? [14:21]
What it takes to get promoted into more senior data science roles [20:55]
The differences between junior, senior, and staff data scientists [30:21]
Mark’s top tools for data extraction, modeling, and pipeline engineering [37:08]
Mark’s number one tip for getting hired at a fast-growing venture capital-backed startup [53:10]
Why all data scientists should be interested in Web3 [1:11:53]
Additional materials: www.superdatascience.com/587

Jun 24, 2022 • 5min
586: Daily Habit #10: Limit Social Media Use
In this episode, Jon dives into the popular topic of social media and its impact on his productivity. Tune in to hear how minimizing the use of social media can positively impact your days, mental health and work. Additional materials: www.superdatascience.com/586

Jun 21, 2022 • 1h 26min
585: PyMC for Bayesian Statistics in Python
In this episode, Dr. Thomas Wiecki, Core Developer of the PyMC Library and CEO of PyMC Labs, joins Jon for a masterclass in Bayesian statistics. Tune in to hear about PyMC, and discover why Bayesian statistics can be more powerful and interpretable than any other data modeling approach.In this episode you will learn:
What Bayesian statistics is [7:30]
Why Bayesian statistics can be more powerful and interpretable than any other data modeling approach [17:20]
How PyMC was developed [20:41]
Commercial applications of Bayesian stats [43:07]
How to build a successful company culture [1:03:14]
What Thomas looks for when hiring [1:11:13]
Thomas’s top resources for learning Bayesian stats yourself [1:13:57]
Additional materials: www.superdatascience.com/585

Jun 17, 2022 • 4min
584: OpenAI Codex
In this episode, Jon reviews the remarkable natural language model Codex by OpenAI. Learn why it has amassed a waitlist and how you can leverage its practical applications in your work. Additional materials: www.superdatascience.com/584