

Super Data Science: ML & AI Podcast with Jon Krohn
Jon Krohn
The latest machine learning, A.I., and data career topics from across both academia and industry are brought to you by host Dr. Jon Krohn on the Super Data Science Podcast. As the quantity of data on our planet doubles every couple of years and with this trend set to continue for decades to come, there's an unprecedented opportunity for you to make a meaningful impact in your lifetime. In conversation with the biggest names in the data science industry, Jon cuts through hype to fuel that professional impact.Whether you're curious about getting started in a data career or you're a deep technical expert, whether you'd like to understand what A.I. is or you'd like to integrate more data-driven processes into your business, we have inspiring guests and lighthearted conversation for you to enjoy.We cover tools, techniques, and implementation tricks across data collection, databases, analytics, predictive modeling, visualization, software engineering, real-world applications, commercialization, and entrepreneurship − everything you need to crush it with data science.
Episodes
Mentioned books

May 24, 2024 • 27min
786: The Six Keys to Data Scientists' Success, with Kirill Eremenko
Learn about the six keys to data science success as host Jon Krohn welcomes back Kirill Eremenko, the mastermind behind SuperDataScience. Kirill shares his top insights on data science careers, from building strong portfolios to leveraging mentors and hands-on labs. With over 2.7 million students, his advice is a must-hear for aspiring and experienced data scientists alike.Additional materials: www.superdatascience.com/786Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.

May 21, 2024 • 1h 6min
785: Math, Quantum ML and Language Embeddings, with Dr. Luis Serrano
Dr. Luis Serrano from the Serrano Academy reveals how to make Math and Quantum ML accessible, tackles the challenges of teaching A.I. to beginners, and explores the power of embeddings in enterprise applications. Explore the future of Quantum Machine Learning and the latest trends in AI, including multimodality and autonomous systems.This episode is brought to you by AWS Inferentia and AWS Trainium. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.In this episode you will learn:• How math and AI can be made easy to understand [05:21]• The three major categories of learners [16:21]• Why embeddings are the most important component of LLMs [26:19]• How semantic search differs from a traditional keyword search [29:57]• The most exciting emerging application areas for AI [42:41]• The promising application areas for Quantum Machine Learning [49:18]Additional materials: www.superdatascience.com/785

May 17, 2024 • 10min
784: Aligning Large Language Models, with Sinan Ozdemir
Aligning LLMs: How can we teach pre-trained LLMs to hold a conversation and learn new information from each other? This was where Sinan Ozdemir began his investigation into aligning LLMs. In this episode, he talks to Jon Krohn about the limitations of definitions for LLMs, training LLMs, and whether it is possible to train an LLM without alignment.Additional materials: www.superdatascience.com/784Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.

May 14, 2024 • 1h 6min
783: Generative A.I. for Solar Power Installation, with Navdeep Martin
Recent advances in GenAI, how to tackle the climate crisis with advanced technology, and addressing the knowledge gap in understanding AI: Jon Krohn speaks to Flypower co-founder and CEO Navdeep Martin about the advances made in GenAI, from products to applications, and how we might use AI to tackle climate change.This episode is brought to you by AWS Inferentia and AWS Trainium. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.In this episode you will learn:• How the Washington Post’s recommendation systems work [03:29]• Why product leaders make great CEOs [10:36]• How Flypower uses GenAI to tackle climate change [22:13]• How Flypower identifies its customers’ most pertinent questions [30:03]• How AI might come to tackle climate change [36:52]• How to mitigate hallucination in AI models [41:04]Additional materials: www.superdatascience.com/783

May 10, 2024 • 41min
782: In Case You Missed It in April 2024
Hear Jon Krohn’s favorite five clips from his April interviews. Chief Scientist at Posit PBC Hadley Wickham on the subtle differences between Python and R. Professor of Business Analytics Barrett Thomas walks through the variables that companies should consider when using drones or any other tech to improve their business operations and bottom line. Aleksa Gordić, Founder of Runa AI believes an overhaul of the current educational system is long overdue. Bernard Marr discusses the future of GenAI and its impact on the world of work. And SuperDataScience founder Kirill Eremenko gives a lively workshop on gradient boosting. Additional materials: www.superdatascience.com/782Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.

May 7, 2024 • 1h 5min
781: Ensuring Successful Enterprise AI Deployments, with Sol Rashidi
Sol Rashidi, a distinguished data executive who has served in C-suite roles at Fortune 100 companies, joins Jon Krohn to delve into successful enterprise AI strategies and the reasons behind the high turnover among Chief Data Officers. This episode provides an in-depth look at selecting AI projects that succeed and understanding the strategic value of patents in various industries. Benefit from Sol’s extensive experience and practical advice on navigating complex corporate challenges.This episode is brought to you by AWS Inferentia and AWS Trainium. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.In this episode you will learn:• How CDOs and related roles have such high turnover because [09:40]• The importance of building relationships in AI projects [17:01]• How Sol's book "The AI Survival Guide" came about [20:44]• How high-criticality, low-complexity AI projects are the ones with the highest probability of success [27:11]• How Enterprise data security issues can be resolved with technologies like Protopia’s stained-glass data-masking solution [36:10]• Why having great data engineers is essential [47:57]• The value of patents [51:45]Additional materials: www.superdatascience.com/781

May 3, 2024 • 8min
780: How to Become a Data Scientist, with Dr. Adam Ross Nelson
Want to become a data scientist? Jon and Adam discuss the key steps to becoming a data scientist, with a focus on developing portfolio projects. Hear about the 10 project ideas Adam recommends in his book to help you stand out in the data science community.Additional materials: www.superdatascience.com/780Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.

Apr 30, 2024 • 1h 28min
779: The Tidyverse of Essential R Libraries and their Python Analogues, with Dr. Hadley Wickham
Tidyverse, ggplot2, and the secret to a tech company’s longevity: Hadley Wickham talks to Jon Krohn about Posit’s rebrand, Tidyverse and why it needs to be in every data scientist’s toolkit, and why getting your hands dirty with open-source projects can be so lucrative for your career.This episode is brought to you by Intel and HPE Ezmeral Software. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.In this episode you will learn:• All about the Tidyverse [04:46]• Hadley’s favorite R libraries [17:10]• The goal of Posit [30:29]• On bringing multiple programming languages together [36:02]• The principles for a long-lasting tech company [52:10]• How Hadley developed ggplot2 [55:24]• How to contribute to the open-source community [1:05:43]Additional materials: www.superdatascience.com/779

Apr 26, 2024 • 7min
778: Mixtral 8x22B: SOTA Open-Source LLM Capabilities at a Fraction of the Compute
Mixtral 8x22B is the focus on this week's Five-Minute Friday. Jon Krohn examines how this model from French AI startup Mistral leverages its mixture-of-experts architecture to redefine efficiency and specialization in AI-powered tasks. Tune in to learn about its performance benchmarks and the transformative potential of its open-source license.Additional materials: www.superdatascience.com/778Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.

Apr 23, 2024 • 1h 9min
777: Generative AI in Practice, with Bernard Marr
Generative AI is reshaping our world, and Bernard Marr, world-renowned futurist and best-selling author, joins Jon Krohn to guide us through this transformation. In this episode, Bernard shares his insights on how AI is transforming industries, revolutionizing daily life, and addressing global challenges. With his extensive experience advising top organizations worldwide, he also examines the ethical considerations of AI deployment.This episode is brought to you by Intel and HPE Ezmeral Software. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.In this episode you will learn:• How Generative AI will transform industries [03:55]• The evolution of Generative AI [10:19]• How will Generative AI impact daily life [16:52]• The ethical challenges of AI [18:55]• How corporations can harness Generative AI for collaboration [24:36]• Industries that will be impacted by Generative AI [32:20]• How Sora-like Generative AI systems will create highly immersive entertainment [42:16]• How Generative AI could unlock 99% of business data [53:34]Additional materials: www.superdatascience.com/777