

The Data Exchange with Ben Lorica
Ben Lorica
A series of informal conversations with thought leaders, researchers, practitioners, and writers on a wide range of topics in technology, science, and of course big data, data science, artificial intelligence, and related applications. Anchored by Ben Lorica (@BigData), the Data Exchange also features a roundup of the most important stories from the worlds of data, machine learning and AI. Detailed show notes for each episode can be found on https://thedataexchange.media/ The Data Exchange podcast is a production of Gradient Flow [https://gradientflow.com/].
Episodes
Mentioned books

Dec 15, 2022 • 49min
A Cloud Native Vector Database Management System
Frank Liu is Director of Operations & ML Architect at Zilliz, the company behind Milvus, an open source vector database. We discuss their recent VLDB paper (“A Cloud Native Vector Database Management System”) that describes recent updates to Milvus, as well as vector databases and vector search in general.Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/Subscribe: Apple • Android • Spotify • Stitcher • Google • AntennaPod • RSS.Detailed show notes can be found on The Data Exchange web site.

Dec 8, 2022 • 38min
What’s Next for Machine Learning in Time Series
Ira Cohen is co-founder, Chief Data Scientist at Anodot, a startup that uses time series tools to monitor business data in real time, so organizations can proactively resolve revenue, cost, and customer experience issues before they impact business performance. We recently wrote a well-received post that provided a detailed overview on the state of technologies for collecting, storing, and unlocking time series. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/Subscribe: Apple • Android • Spotify • Stitcher • Google • AntennaPod • RSS.Detailed show notes can be found on The Data Exchange web site.

Dec 1, 2022 • 46min
Efficient Methods for Natural Language Processing
Roy Schwartz is Professor of Natural Language Processing at The Hebrew University of Jerusalem. We discussed a recent survey paper that Roy co-wrote that presented a broad overview of existing methods to improve NLP efficiency through the lens of traditional NLP pipelines. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/Subscribe: Apple • Android • Spotify • Stitcher • Google • AntennaPod • RSS.Detailed show notes can be found on The Data Exchange web site.

Nov 23, 2022 • 30min
Responsible and Trustworthy AI
On this Thanksgiving holiday weekend in the U.S., we revisit a Twitter Spaces conversation I had withAndrew Burt, Managing Partner at BNH1, the first law firm focused on AI risks.Bob Friday, Chief AI Officer at Juniper Networks.Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/Subscribe: Apple • Android • Spotify • Stitcher • Google • AntennaPod • RSS.Detailed show notes can be found on The Data Exchange web site.

Nov 17, 2022 • 38min
Building a premier industrial AI research and product group
Hung Bui is the CEO of VinAI, a premier Artificial Intelligence research-based company developing world-class products and services. Hung assembled the VinAI team just over three years ago and they are now among the Top 20 Global Companies in AI Research in 2022. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/Subscribe: Apple • Android • Spotify • Stitcher • Google • AntennaPod • RSS.Detailed show notes can be found on The Data Exchange web site.

Nov 10, 2022 • 35min
An open source, production grade vector search engine
Bob van Luijt, is CEO of SeMI Technologies, the company behind the popular vector search engine Weaviate. Bob describes their key features and core components, popular use cases, and he also provides an overview of Weaviate’s near-term roadmap. We also discuss how vector search engines compare with existing data management systems.Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/Subscribe: Apple • Android • Spotify • Stitcher • Google • AntennaPod • RSS.Detailed show notes can be found on The Data Exchange web site.

Nov 3, 2022 • 35min
A comprehensive suite of open source tools for time series modeling
Federico Garza and Max Mergenthaler Canseco are both CTOs and co-founders of Nixtla, a startup building developer-friendly software that helps data scientists deploy predictive pipelines.Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • RSS.Detailed show notes can be found on The Data Exchange web site.

Oct 27, 2022 • 31min
Building Safe and Reliable AI applications
Christopher Nguyen is CEO and cofounder of Aitomatic, a startup that uses a knowledge-first approach to build and deploy machine learning solutions, with a focus on industrial applications (manufacturing and other physical settings).Join us at K1st World, a fantastic symposium and networking event slated for November 16 & 17. Use the discount code GRADIENTFLOW60 to attend in person or online.Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/Subscribe: Apple • Android • Spotify • Stitcher • Google • AntennaPod • RSS.Detailed show notes can be found on The Data Exchange web site.

Oct 20, 2022 • 42min
A new storage engine for vectors
Ram Sriharsha is VP of Engineering and R&D at Pinecone, a startup that offers a fully managed vector database (not just an index). We discuss Pinecone’s new proprietary storage engine, which was first described around the time we recorded this conversation.Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/Subscribe: Apple • Android • Spotify • Stitcher • Google • AntennaPod • RSS.Detailed show notes can be found on The Data Exchange web site.

Oct 13, 2022 • 42min
Project Lightspeed: Next-generation Spark Streaming
Karthik Ramasamy, is the Head of Streaming at Databricks. He has extensive experience in streaming, having led teams at Twitter (Apache Heron), Splunk, and Streamlio (Apache Pulsar).Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/Subscribe: Apple • Android • Spotify • Stitcher • Google • AntennaPod • RSS.Detailed show notes can be found on The Data Exchange web site.