

The Data Exchange with Ben Lorica
Ben Lorica
A series of informal conversations with thought leaders, researchers, practitioners, and writers on a wide range of topics in technology, science, and of course big data, data science, artificial intelligence, and related applications. Anchored by Ben Lorica (@BigData), the Data Exchange also features a roundup of the most important stories from the worlds of data, machine learning and AI. Detailed show notes for each episode can be found on https://thedataexchange.media/ The Data Exchange podcast is a production of Gradient Flow [https://gradientflow.com/].
Episodes
Mentioned books

Aug 21, 2025 • 47min
The Fenic Approach to Production-Ready Data Processing
Kostas Paralis, co-founder of Typedef, introduces Fenic, an innovative open-source framework for AI-driven data processing. He unpacks how Fenic treats inference as a key operation, transforming unstructured data management, and optimizing data pipelines. The conversation dives into the evolution of data technologies, the challenges of integrating structured and unstructured data, and the revolutionary potential of large language models. Kostas also discusses the importance of open file formats and practical use cases in cybersecurity and healthcare, enhancing operational efficiency.

Aug 16, 2025 • 28min
When AI Eats the Bottom Rung of the Career Ladder
Evangelos Simoudis from Synapse Partners shares insights on critical trends in the AI industry. He discusses the 'Great Hollowing Out,' where automation is wiping out entry-level jobs, making it harder for fresh graduates. The conversation dives into the economic implications of AI hardware depreciation and changing team dynamics that blend experienced and novice talent. Simoudis also explores the evolution of AI business models, particularly in light of GPT-5, reshaping how companies engage with AI services.

Aug 14, 2025 • 33min
From NotebookLM to Audio Companions: Why Google’s AI Team Went Startup
Raiza Martin, co-founder of Huxe and former leader of Google's NotebookLM team, shares insights on her exciting transition from a tech giant to startup life. She discusses the innovative potential of audio-first personal AI companions, highlighting how they can enhance daily interactions. The conversation dives into the evolution of AI models and the emotional connections users form with chatbots. Raiza also addresses the challenges of deploying AI and the importance of privacy, showcasing a future where technology feels increasingly personal.

Aug 7, 2025 • 42min
The AI-Native Notebook That Thinks Like a Spreadsheet
In this engaging discussion, Akshay Agrawal, founder and CEO of Marimo and former TensorFlow engineer, dives into the revolutionary features of Marimo, an open-source reactive notebook aimed at enhancing Python coding. He explores how AI integration provides runtime context for improved code generation, facilitating diverse applications from cybersecurity to DevOps. Akshay also highlights the unique capabilities of Marimo compared to traditional notebooks and the importance of live coding in data analysis, all while addressing the evolving landscape of user needs in data platforms.

8 snips
Jul 31, 2025 • 40min
How Agentic AI is Transforming Wall Street
Josh Pantony, CEO of Boosted AI, discusses the groundbreaking possibilities of agentic AI in finance. He highlights Alpha, a platform that turns AI into proactive workers for finance professionals, significantly enhancing productivity. The conversation delves into the challenges of maintaining auditability and the importance of balancing speed with quality in AI responses. Pantony also emphasizes the shift toward multimedia consumption of financial information among younger generations, showcasing how AI can revolutionize data analysis and market insights.

Jul 24, 2025 • 46min
The Quantum Advantage Is Real—But Where's the Infrastructure?
Jennifer Prendki, a former DeepMind expert in AI and quantum computing, shares insights on the current landscape of quantum technology. She highlights the rise of specialized quantum accelerators designed for AI in sectors like finance and pharma. The main challenge isn’t just hardware, but the need for innovative software solutions due to fundamental principles like the no-cloning theorem. Prendki emphasizes the importance of integrating quantum data with classical systems and discusses how quantum computing could revolutionize machine learning and data analysis.

10 snips
Jul 17, 2025 • 38min
From Human-Readable to Machine-Usable: The New API Stack
Sagar Batchu, CEO of Speakeasy, dives into the revolution in API development, emphasizing the shift as AI agents become primary users. He discusses 'vibe coding' for creating adaptable APIs and the challenges of managing multiple APIs. The conversation touches on enhancing AI integration and the significance of tools like Speakeasy in simplifying API interactions. Batchu highlights the importance of multi-cloud platforms and robust security measures, alongside innovations in user experience designed for both technical and non-technical users.

Jul 10, 2025 • 42min
Why Voice Security Is Your Next Big Problem
Yishay Carmiel and Roy Zanbel, co-founders of Apollo Defend, dive into the rapidly evolving landscape of voice AI security. They discuss the alarming implications of voice cloning technology, emphasizing its potential misuse and the urgent need for protective measures. The conversation highlights advancements in human-like speech generation and the complexities of defending against deepfake audio attacks. With voice agents proliferating in customer service, they stress the necessity of robust security measures to safeguard personal authenticity and data privacy.

35 snips
Jul 3, 2025 • 28min
Unlocking Unstructured Data with LLMs
Shreya Shankar, a PhD student in EECS at UC Berkeley, dives into how Large Language Models (LLMs) are changing the game for unstructured enterprise data. She explains her innovative framework, DocETL, which streamlines semantic extraction and thematic analysis of text and PDFs. The conversation touches on the practical challenges of data extraction and the evolution towards multimodal processing with tools like DocWrangler. Shreya also highlights the importance of aligning user intent with model capabilities for better user experiences.

16 snips
Jun 26, 2025 • 31min
Building Production-Grade RAG at Scale
Douwe Kiela, Founder and CEO of Contextual AI and an adjunct professor at Stanford, delves into the relevance of Retrieval-Augmented Generation (RAG) amidst evolving AI contexts. He explains the shift to RAG 2.0, emphasizing its potential as an end-to-end trainable system. The conversation highlights the challenges of document understanding, the importance of structured information in extraction, and how hybrid retrieval methods can streamline data access. Douwe also speculates on future advancements in model fine-tuning, emphasizing the need for expert feedback and open-source contributions.