The Data Stack Show

258: Confidently Wrong: Why AI Needs Tools (and So Do We)

Aug 20, 2025
This discussion dives into AI's latest advancements, especially the evolution of GPT models and the importance of reliable tools to minimize hallucinations. It contrasts data warehouses with agent-based approaches, addressing the risks involved. The conversation features engaging analogies from poker and football, emphasizing the art of risk-taking in data teams. Notably, the hosts advocate for balancing innovation with risk management and highlight the value of fiction in understanding human behavior—all while navigating the complexities of real-time data processing.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Tool-First Models Reduce Hallucinations

  • Open-source LLMs may hallucinate frequently but are designed to call deterministic tools to compensate.
  • Tool-first models trade internal factuality for external determinism to improve real-world usefulness.
INSIGHT

LLMs As Interfaces To Deterministic Tools

  • Combining LLMs with deterministic tools makes agents more reliable and controllable.
  • The LLM becomes an interface while the tools perform precise, deterministic work.
INSIGHT

No One-Size Data Architecture

  • The warehouse vs agent debate will hinge on use case, data volume, and team preference.
  • Large-scale, complex joins and transformations still favor centralized warehouses or lakehouses today.
Get the Snipd Podcast app to discover more snips from this episode
Get the app