
Super Data Science: ML & AI Podcast with Jon Krohn
The latest machine learning, A.I., and data career topics from across both academia and industry are brought to you by host Dr. Jon Krohn on the Super Data Science Podcast. As the quantity of data on our planet doubles every couple of years and with this trend set to continue for decades to come, there's an unprecedented opportunity for you to make a meaningful impact in your lifetime. In conversation with the biggest names in the data science industry, Jon cuts through hype to fuel that professional impact.Whether you're curious about getting started in a data career or you're a deep technical expert, whether you'd like to understand what A.I. is or you'd like to integrate more data-driven processes into your business, we have inspiring guests and lighthearted conversation for you to enjoy.We cover tools, techniques, and implementation tricks across data collection, databases, analytics, predictive modeling, visualization, software engineering, real-world applications, commercialization, and entrepreneurship − everything you need to crush it with data science.
Latest episodes

Dec 6, 2022 • 54min
633: Responsible Decentralized Intelligence
This week's episode is all about Responsible Decentralized Intelligence as award-winning professor and tech entrepreneur, Dawn Song, joins Jon Krohn to help us explore this exciting topic in-depth.This episode is brought to you by Iterative (iterative.ai), your mission control center for machine learning. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.In this episode you will learn:• What is decentralized intelligence? [3:46]• Dawn’s Responsible Data Economy collaboration with Meta AI [11:31]• How homomorphic encryption, differential privacy, and multi-party computation can work together [16:22]• How PrivateSQL makes differential privacy easy to use [22:54]• The relationship between deep learning and federated learning [37:55]• What is a responsible data economy [42:13]Additional materials: www.superdatascience.com/633

Dec 2, 2022 • 11min
632: Liquid Neural Networks
Liquid neural networks are a type of bio-inspired machine learning set to make a huge impact in the field of data analytics. On this week’s Five-Minute Friday, Jon Krohn speaks with Pathway.com Co-Founder Dr. Adrian Kosowski about the development of this new type of network and what this means for the future of data.Additional materials: www.superdatascience.com/630Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

Nov 29, 2022 • 59min
631: Data Analytics Career Orientation
Interview success, funny memes about data, and stakeholder management: Jon Krohn speaks with Luke Barousse, a full-time YouTuber who produces content to help aspiring data scientists. First, Jon and his guest go underwater to find out how data science can help you while working on a submarine before they emerge onto Luke’s YouTube channel. There, he discloses all the helpful hacks for data science beginners—with a generous helping of humor! As founder of MacroFit, a data-driven company that helps with meal planning, Luke is no stranger to portion sizes…This episode is brought to you by Iterative (iterative.ai), your mission control center for machine learning. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.In this episode you will learn:• Where Luke gets his inspiration for making YouTube videos [04:46]• How Luke got into creating comedy skits [08:21]• Luke’s favorite Python libraries for web scraping [14:41]• Incorrect assumptions that aspiring data scientists make [15:54]• The best time to use Power BI [19:15]• The biggest mistakes Luke made in his data science career [22:17]• Luke’s experience as a submariner and how it helped him in his data analyst career [38:13]• The must-have skills for entry-level data analyst roles [43:46]Additional materials: www.superdatascience.com/631

Nov 25, 2022 • 6min
630: Resilient Machine Learning
Jon Krohn sits with Dr. Dan Shiebler at the Open Data Science Conference (ODSC) to dive into the critical components of building resilient machine learning.Additional materials: www.superdatascience.com/630Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

6 snips
Nov 22, 2022 • 1h 11min
629: Software for Efficient Data Science
Has the term developer advocacy ever left you scratching your head? This week data science developer advocate for JetBrains, Dr. Jodie Burchell, joins Jon Krohn to shed light on her responsibilities and why it's a role you might want to consider. Jodie also dives into building reproducible data science workflows and the keys to working effectively with real-world data.This episode is brought to you by Iterative (iterative.ai), the open-source company behind DVC. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.In this episode you will learn:• Jodie’s background in psychology [2:22]• Jodie's tips for real-world data preparation [6:55]• Tour JetBrains' developer tools: PyCharm, DataSpell and Datalore [10:41]• What is a data science developer advocate? [38:47]• The books that Jodie's co-authored [46:18]• Jodie's favorite Python libraries [58:33]• How to have reproducible data science workflows [1:01:36]Additional materials: www.superdatascience.com/629

Nov 18, 2022 • 5min
628: The Critical Human Element of Successful A.I. Deployments
On this episode of Five-Minute Friday, Jon Krohn speaks from the Open Data Science Conference (ODSC). There, he sits down with author and data scientist Keith McCormick to discuss the conference’s key trend: learning the importance of trust in the relationship between humans and algorithms.Additional materials: www.superdatascience.com/628Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

Nov 15, 2022 • 1h 31min
627: AutoML: Automated Machine Learning
Jon Krohn speaks with Erin LeDell, H2O.ai’s Chief Machine Learning Scientist. They investigate how AutoML supercharges the data science process, the importance of admissible machine learning for an equitable data-driven future, and what Erin’s group Women in Machine Learning & Data Science is doing to increase inclusivity and representation in the field.This episode is brought to you by Datalore (datalore.online/SDS), the collaborative data science platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.In this episode you will learn:• The H2O AutoML platform Erin developed [07:43]• How genetic algorithms work [19:17]• Why you should consider using AutoML? [28:15]• The “No Free Lunch Theorem” [33:45]• What Admissible Machine Learning is [37:59]• What motivated Erin to found R-Ladies Global and Women in Machine Learning and Data Science [47:00]• How to address bias in datasets [57:03]Additional materials: www.superdatascience.com/627

Nov 11, 2022 • 7min
626: Subword Tokenization with Byte-Pair Encoding
Word tokenization, character tokenization and subword tokenization go head-to-head this week as Jon Krohn delivers a mini-bootcamp on the NLP-related process.Additional materials: www.superdatascience.com/626Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

Nov 8, 2022 • 1h 4min
625: Analyzing Blockchain Data and Cryptocurrencies
Chainalysis' Director of Research, Kim Grauer joins Jon Krohn to explore the state of economic-data analysis on the blockchain.This episode is brought to you by Datalore (datalore.online/SDS), the collaborative data science platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.In this episode you will learn:• Kim's role as Director of Research [5:02]• The unique real-time economic-data analytics of the blockchain [13:07]• How ML can predict patterns of criminal activity on the blockchain [18:56]• Interesting use cases of ML for crime investigation [29:37]• The tools and approaches Kim uses daily [47:44]• The future of crypto, blockchains, and data science [50:54]• Why a data science bootcamp helps people break into data science [53:42]Additional materials: www.superdatascience.com/625

Nov 4, 2022 • 7min
624: Imagen Video: Incredible Text-to-Video Generation
On this week’s Five-Minute Friday, Jon Krohn investigates Imagen Video, Google’s latest model for making video art out of text prompts. Recently published, this text-to-image converter now competes against already strong competitors on the scene like DALL-E 2. Unlike DALL-E 2, it returns moving images or time-based media. Tune in to hear Jon explain the technology that made Imagen Video the tech giant’s shiniest new tool to date.Additional materials: www.superdatascience.com/624Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.