The Tech Strategy Podcast

What is the Cost of Maintaining the Correctness of a GenAI Service? (268)

Nov 25, 2025
The discussion dives into the costs of maintaining correctness in generative AI services. It highlights how correctness impacts AI economics and product strategies. Jeff contrasts modern AI data centers with traditional ones, exploring architectural implications for cost. You'll hear about the trade-offs between compute costs for self-hosted models versus APIs. The importance of required accuracy levels varies significantly across applications, along with considerations of human involvement and ongoing maintenance costs. The complexities of evolving correctness over time are also examined.
Ask episode
AI Snips
Chapters
Books
Transcript
Episode notes
INSIGHT

Correctness Drives AI Economics

  • Generative AI changes core economics because maintaining correctness becomes central to costs and strategy.
  • Correctness can shift a product from high-margin software to low-margin service-like economics.
INSIGHT

Foundation Models Add A New Cost Layer

  • Foundation models add a fourth architectural component: app, database, compute, and model.
  • That extra component fundamentally alters cost structure and competitive dynamics.
ADVICE

Account For Initial And Ongoing Compute Costs

  • Calculate compute costs including energy, cooling, and cloud provider bills for both initial build and ongoing inference.
  • Choose between self-hosting a downloaded model or using cloud APIs based on upfront versus recurring cost tradeoffs.
Get the Snipd Podcast app to discover more snips from this episode
Get the app