

The Data Exchange with Ben Lorica
Ben Lorica
A series of informal conversations with thought leaders, researchers, practitioners, and writers on a wide range of topics in technology, science, and of course big data, data science, artificial intelligence, and related applications. Anchored by Ben Lorica (@BigData), the Data Exchange also features a roundup of the most important stories from the worlds of data, machine learning and AI. Detailed show notes for each episode can be found on https://thedataexchange.media/ The Data Exchange podcast is a production of Gradient Flow [https://gradientflow.com/].
Episodes
Mentioned books

Jul 28, 2022 • 36min
Data Infrastructure for Computer Vision
Danny Bickson and Amir Alush are the creators of fastdup, a very impressive free tool for surfacing duplicates, anomalies, and leakage in visual data. In line with its name, it’s fast: fastdup is written in C++ and can handle millions of images easily. Download a FREE copy of our recent NLP Industry Survey Results: https://gradientflow.com/2021nlpsurvey/Subscribe: Apple • Android • Spotify • Stitcher • Google • AntennaPod • RSS.Detailed show notes can be found on The Data Exchange web site.

Jul 21, 2022 • 38min
How DALL·E works
Mark Chen is a Research Scientist at OpenAI and part of the team behind DALL·E 2, a new AI system that can create realistic images and art based on natural language descriptions. Download a FREE copy of our recent NLP Industry Survey Results: https://gradientflow.com/2021nlpsurvey/Subscribe: Apple • Android • Spotify • Stitcher • Google • AntennaPod • RSS.Detailed show notes can be found on The Data Exchange web site.

Jul 14, 2022 • 47min
Scalable, end-to-end machine learning, for everyone
Jules Damji is lead developer advocate, and Richard Liaw is an engineering manager at Anyscale, the startup founded by the creators of Ray, the open source project that makes it simple to scale any compute-intensive Python workload. To learn more about Ray and how to scale machine learning applications, attend the Ray Summit (San Francisco / Aug 23-24) https://www.anyscale.com/ray-summit-2022?utm_source=gradientflow&utm_medium=DEpodcastSubscribe: Apple • Android • Spotify • Stitcher • Google • AntennaPod • RSS.Detailed show notes can be found on The Data Exchange web site

Jul 7, 2022 • 44min
Orchestration and Pipelines for Data Scientists
Rick Lamers is co-Founder and CEO at Orchest, the startup behind an open source project that enables data scientists to create, manage, and execute complex end-to-end data pipelines. Download the FREE Report: State of Workflow Orchestration → https://gradientflow.com/2022-workflow-orchestration-survey/?utm_source=gradientflow&utm_medium=DEpodcastSubscribe: Apple • Android • Spotify • Stitcher • Google • AntennaPod • RSS.Detailed show notes can be found on The Data Exchange web site

Jun 30, 2022 • 37min
Dataframes at scale
Devin Petersohn is CTO and co-founder of Ponder, and the creator of Modin, a fast, scalable, drop-in replacement for the popular Pandas library. Download the FREE Report: State of Workflow Orchestration → https://gradientflow.com/2022-workflow-orchestration-survey/?utm_source=gradientflow&utm_medium=DEpodcastSubscribe: Apple • Android • Spotify • Stitcher • Google • AntennaPod • RSS.Detailed show notes can be found on The Data Exchange web site

Jun 23, 2022 • 41min
Software-Defined Assets
Nick Schrock is founder and Elementl, the startup behind Dagster, a popular open source, data orchestration platform. We discussed recent trends in data engineering and infrastructure, and Dagster’s introduction of software-defined assets, a new approach to managing, maintaining, and orchestrating data declaratively.Download the FREE Report: State of Workflow Orchestration → https://gradientflow.com/2022-workflow-orchestration-survey/?utm_source=gradientflow&utm_medium=DEpodcastSubscribe: Apple • Android • Spotify • Stitcher • Google • AntennaPod • RSS.Detailed show notes can be found on The Data Exchange web site

Jun 16, 2022 • 47min
Adversarial Machine Learning
Edmon Begoli, leads the AI Systems R&D section at Oak Ridge National Laboratory (ORNL), where he is also a distinguished member of the ORNL research staff. Our conversation centered on his upcoming presentation at the Data+AI Summit, where he will describe the four principal categories of Adversarial AI and their future implications.Download the FREE Report: Trends in Data, Machine Learning, and AI → https://gradientflow.com/2022trendsreport?utm_source=DEpodcastSubscribe: Apple • Android • Spotify • Stitcher • Google • AntennaPod • RSS.Detailed show notes can be found on The Data Exchange web site.

Jun 9, 2022 • 47min
Orchestrating Machine Learning Applications
Haytham Abuelfutuh is co-founder and CTO of Union, a startup founded by the team behind Flyte, a popular open source project originated by Lyft. Flyte is a workflow automation platform used for many different applications, but especially as an orchestrator for machine learning applications.Download the FREE Report: State of Workflow Orchestration → https://www.prefect.io/lp/gradientflow?utm_source=gradientflow&utm_medium=newsletterSubscribe: Apple • Android • Spotify • Stitcher • Google • AntennaPod • RSS.Detailed show notes can be found on The Data Exchange web site.

Jun 2, 2022 • 41min
Narrative AI
This week’s guest is Hilary Mason, co-founder of Hidden Door, a startup that uses AI and machine learning to help create and power role-playing games (RPG). Download a FREE copy of our recent NLP Industry Survey Results: https://gradientflow.com/2021nlpsurvey/Subscribe: Apple • Android • Spotify • Stitcher • Google • AntennaPod • RSS.Detailed show notes can be found on The Data Exchange web site.

May 26, 2022 • 40min
Machine Learning Model Observability
Oren Razon is CEO and co-founder of Superwise, a startup that builds tools to streamline observability for machine learning models. This episode provides a comprehensive overview of tools and best practices for deploying, monitoring, and managing machine learning models in production.Download the FREE Report: Trends in Data, Machine Learning, and AI → https://gradientflow.com/2022trendsreport?utm_source=DEpodcastSubscribe: Apple • Android • Spotify • Stitcher • Google • AntennaPod • RSS.Detailed show notes can be found on The Data Exchange web site.