Super Data Science: ML & AI Podcast with Jon Krohn cover image

Super Data Science: ML & AI Podcast with Jon Krohn

Latest episodes

undefined
Apr 25, 2023 • 1h 12min

673: Taipy, the open-source Python application builder

Vincent Gosselin, CEO and co-founder of Taipy, an open-source Python library, joins Jon Krohn to discuss how to accelerate productivity in Python and build scalable, reusable, and maintainable data pipelines. Gosselin shares his breadth of wisdom honed over his decades-long AI career.This episode is brought to you by Pathway, the reactive data processing framework, and by Posit, the open-source data science company. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.In this episode you will learn:• The Taipy library functionality [2:59]• The future of data pipelines [21:40]• Common trends of companies that are successful at adopting data pipelines [28:31]• How no-code and low-code trends impact the data science lifecycle [33:00]• How Vincent chose the programming languages that underpin Taipy [41:40]• Common trends on how companies manage their data to learn from it [45:06]• Vincent's perspective on AI winters [51:03]Additional materials: www.superdatascience.com/673
undefined
Apr 21, 2023 • 17min

672: Open-source "ChatGPT": Alpaca, Vicuña, GPT4All-J, and Dolly 2.0

Get started with language models: Learn about the commercial-use options available for your business in this week’s Five-Minute Friday, where host Jon Krohn discusses four models that have many of the capabilities of ChatGPT and can run at a fraction of the cost.Additional materials: www.superdatascience.com/672Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
undefined
Apr 18, 2023 • 1h 3min

671: Cloud Machine Learning

Get to grips with AWS, Azure, Google Cloud Platform on this week’s episode. Host Jon Krohn speaks with Kirill Eremenko and Hadelin de Ponteves about CloudWolf, a cloud computing educational platform that prepares students for certification in AWS (Amazon Web Services). Find out why an accreditation in cloud computing could be the safest investment for your data science career.This episode is brought to you by Posit, the open-source data science company, and by AWS Inferentia. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.In this episode you will learn:• About CloudWolf [07:04]• Why learning the cloud is important for data scientists [09:12]• Is learning cloud computing complex? [22:30]• Essential AWS services [28:31]• Database options on AWS [33:47]• How to run analytics on AWS [40:58]• Why an AWS certification is so helpful [56:35]Additional materials: www.superdatascience.com/671
undefined
Apr 14, 2023 • 13min

670: LLaMA: GPT-3 performance, 10x smaller

How does Meta AI's natural language model, LLaMa compare to the rest? Based on the Chinchilla scaling laws, LLaMa is designed to be smaller but more performant. But how exactly does it achieve this feat? It's all done by training a small model for a longer period of time. Discover how LLaMa compares to its competition, including GPT-3, in this week's episode. Additional materials: www.superdatascience.com/670Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
undefined
Apr 11, 2023 • 1h 41min

669: Streaming, reactive, real-time machine learning

In this episode, Jon Krohn welcomes Adrian Kosowski, Co-Founder and Chief Product Officer at Pathway, who shares insights on streaming data processing and reactive data processing, and how they're shaping the future of machine learning. Tune in now for an unforgettable episode.This episode is brought to you by Posit, the open-source data science company, and by AWS Inferentia. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.In this episode you will learn:• About Pathway's reactive data processing framework [04:45]• Reactive data processing use cases [17:08]• What is the difference between batch and streaming processing [33:18]• Transformers in data engineering and data streaming [53:44]• The benefits of Adrian's technical background as a CPO [1:04:17]• Adrian's responsibilities and favorite tools as a CPO [1:15:25]• Emerging ML approaches and tools for startups [1:28:49]Additional materials: www.superdatascience.com/669
undefined
Apr 7, 2023 • 56min

668: GPT-4: Apocalyptic stepping stone?

AI risks, RLHF, and inner alignment: GPT stands to give the business world a major boost. But with everyone racing either to develop products that incorporate GPT or use it to carry out critical tasks, what dangers could lie ahead in working with a tool that applies essentially unknowable means (inner alignments) to reach its goals? This week’s guest Jérémie Harris speaks with Jon Krohn about the essential need for anyone working with GPT to understand the impact of a system comprising inner alignments that cannot – and may never – be fully understood.Additional materials: www.superdatascience.com/668Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. 
undefined
Apr 4, 2023 • 1h 5min

667: Harnessing GPT-4 for your Commercial Advantage

GPT-4, augmenting human tasks with AI, and using GPT-4 commercially: Vin Vashishta speaks to host Jon Krohn about how to leverage GPT-4 and outperform your competitors in both speed and value. Learn how GPT-4 has outmatched its predecessors – and many skilled workers – in this latest iteration of large language models.This episode is brought to you by Pathway, the reactive data processing framework, by Posit, the open-source data science company, and by epic LinkedIn Learning instructor Keith McCormick. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.In this episode you will learn:• Using GPT-4 to screen for jobs [06:26]• A framework for improving systems with GPT [13:32]• Teaming, tooling and collaborating with GPT-4 [29:58]• How to accelerate data science with generative A.I. [45:36]• How to prepare for opportunities with GPT-4 [52:09]Additional materials: www.superdatascience.com/667
undefined
Mar 31, 2023 • 12min

666: GPT-4

GPT-4 has landed! But how well does it compare to GPT-3.5? Tune in to hear Jon stack its performance against its predecessor–the results might just blow your mind.Additional materials: www.superdatascience.com/666Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
undefined
Mar 28, 2023 • 1h 28min

665: How to be both socially impactful and financially successful in your data career

Angel investor and data science consultant Josh Wills sits down with Jon Krohn to discuss his former roles (Google, Slack, and Cloudera) and the essential skills for engineering scalable machine learning projects.This episode is brought to you by Pathway, the reactive data processing framework, and by epic LinkedIn Learning instructor Keith McCormick. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.In this episode you will learn:• Josh's 'Data Engineering for Machine Learning' course [06:50]• Contextual bandits [10:52]• Data quality and monitoring [16:45]• The “infinite loop of sadness” in data product development [25:12]• Josh’s definition of a data scientist [30:02]• Josh's role at WeaveGrid [37:36]• Management-Track vs Independent Contributor [48:47]• Josh's work on the Covid pandemic [1:06:46]• Josh’s favorite tech stack [1:11:13]Additional materials: www.superdatascience.com/665
undefined
Mar 24, 2023 • 5min

664: MIT Study: ChatGPT Dramatically Increases Productivity

Can ChatGPT make us better and faster in our work, and is it the future or just another fad? In this episode, Jon Krohn delves into a new study from MIT about the tool’s potential productivity for white-collar tasks.Additional materials: www.superdatascience.com/664Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app