

The Data Exchange with Ben Lorica
Ben Lorica
A series of informal conversations with thought leaders, researchers, practitioners, and writers on a wide range of topics in technology, science, and of course big data, data science, artificial intelligence, and related applications. Anchored by Ben Lorica (@BigData), the Data Exchange also features a roundup of the most important stories from the worlds of data, machine learning and AI. Detailed show notes for each episode can be found on https://thedataexchange.media/ The Data Exchange podcast is a production of Gradient Flow [https://gradientflow.com/].
Episodes
Mentioned books

11 snips
Dec 18, 2025 • 40min
The Developer’s Guide to LLM Security
Steve Wilson, Chief AI and Product Officer at Exabeam, dives into the complexities of securing Large Language Models and agent workflows. He highlights the unique risks of prompt injection and supply chain vulnerabilities that arise with democratized AI tools. Wilson discusses the importance of guardrails, the dangers of excessive agent authority, and lessons learned from web security mishaps. He also explores the concept of citizen developers and advocates for the OWASP GenAI Security Project to provide rapid community-driven guidance for safer AI practices.

13 snips
Dec 13, 2025 • 44min
Is AI a Utility? Defining Usability and Public Trust
Evangelos Simoudis, a venture investor and corporate innovation expert at Synapse Partners, joins to discuss the dual-edged sword of AI in the workforce. They examine AI-driven layoffs, emphasizing the importance of investment in upskilling and R&D. The conversation highlights how trust in AI outputs can be fragile due to unpredictable behaviors and the need for better government coordination on AI access. They also delve into legal complexities surrounding platform liability as AI systems take on more editorial roles in content creation.

10 snips
Dec 11, 2025 • 30min
How to Build AI Copilots That Teach Rather Than Automate
Stefania Druga, an independent researcher and former Google DeepMind scientist, delves into creating AI tools for young learners. She shares insights on how children's natural curiosity informs better AI design. Stefania champions the Socratic method for teaching, highlighting her work on Cognimates as a supportive learning copilot. They discuss the importance of multimodal interfaces and the challenges of current AI education tools. Listen in as she reveals how real-time apps like MathMind address misconceptions in math, pushing for innovative solutions in AI education.

22 snips
Dec 4, 2025 • 48min
The AI Revolution Finally Comes to Structured Data
Jure Leskovec, a Stanford professor and co-founder of Kumo.ai, dives into the transformative power of relational foundation models for structured enterprise data. He challenges the current limitations of AI in handling relational data, emphasizing the shortcomings of treating tabular data as text. Jure outlines Kumo’s rapid predictive SQL-like language, innovative graph representations, and the model's ability to handle messy data effectively. He also discusses real-world successes like DoorDash's significant improvements and the potential applications of these models across various industries.

10 snips
Nov 26, 2025 • 48min
Building the Knowledge Layer Your Agents Need
Philip Rathle, CTO of Neo4j and a leading expert in graph technologies, explores the integration of knowledge graphs in enterprise AI. He discusses the real-world application of GraphRAG, detailing how it enhances context for AI agents. Rathle highlights successful enterprises using this technology and warns against overly complex projects. He also showcases tools like the LLM Graph Builder for building starter knowledge graphs and emphasizes the need for clear governance and determinism in AI systems, ultimately illustrating how graphs can significantly improve AI reasoning.

28 snips
Nov 20, 2025 • 26min
How Language Models Actually Think
Emmanuel Ameisen, an interpretability researcher at Anthropic and author, dives into the workings of large language models. He explains how these models can resemble biological systems and reveals surprising problem-solving patterns, like predicting multiple tokens at once. Emmanuel also addresses the misleading nature of reasoning outputs and the neural mechanics behind hallucinations. He emphasizes the importance of model calibration, debugging tools, and even shares practical advice for developers. It's a fascinating look at the complexity of AI behavior!

8 snips
Nov 15, 2025 • 33min
How AI Is Reshaping Jobs, Budgets, and Data Centers
Evangelos Simoudis, a venture investor and corporate innovation expert at Synapse Partners, joins the discussion to unpack how AI is reshaping the workforce and corporate strategies. He explains the complexities behind AI-driven layoffs, categorizing them into untrainability and automation. They also explore the need for ROI in massive capital investments in AI, while discussing how LLMOps can manage financial efficiencies. Evangelos emphasizes the importance of cross-functional teams for successful AI integration, highlighting the evolving landscape of AI as a utility.

12 snips
Nov 13, 2025 • 50min
Making Data Engineering Safe for Automation and Agents
Ciro Greco, Co-founder and CEO of Bauplan, discusses revolutionizing data engineering by applying software principles like version control and transactional pipelines to data lakes. He highlights the unique challenges of data work, such as scale and fragmentation, and introduces a git-like branching model for enhanced reproducibility. Ciro emphasizes the importance of transactional guarantees, especially for automated agents, and advocates for a code-first approach to enable safe and efficient interactions with data platforms.

13 snips
Nov 6, 2025 • 56min
Is Your Database Ready for an Army of AI Agents?
Mike Freedman and Ajay Kulkarni, co-founders of Tiger Data, discuss their innovative creation, Agentic Postgres, a database tailored for AI agents. They explain how traditional Postgres struggles with search and scalability when used by agents and introduce concepts like Fluid Storage for instant database forks. The conversation covers the importance of developer-friendly features and the unique capabilities of their MCP server, which enhances model training while ensuring security. They also touch on using Postgres for unstructured data and the future of database tooling.

Oct 30, 2025 • 46min
Beyond the Dashboard: Collaborative Analytics in Slack
Nick Schrock, CTO and founder of Dagster, explores the transformative power of data orchestration in the age of AI. He introduces Compass, a Slack-native tool designed for collaborative data analysis, aiming to replace inefficient ad-hoc dashboards. Schrock explains the concept of context pipelines, crucial for AI strategies, and highlights how Compass improves user interaction and onboard processes. He also discusses the synergy of automation and human contributions, making data workflows more efficient and integrated in team environments.


