
The Tech Strategy Podcast What is the Cost of Maintaining the Correctness of a GenAI Service? (268)
Nov 25, 2025
The discussion dives into the costs of maintaining correctness in generative AI services. It highlights how correctness impacts AI economics and product strategies. Jeff contrasts modern AI data centers with traditional ones, exploring architectural implications for cost. You'll hear about the trade-offs between compute costs for self-hosted models versus APIs. The importance of required accuracy levels varies significantly across applications, along with considerations of human involvement and ongoing maintenance costs. The complexities of evolving correctness over time are also examined.
AI Snips
Chapters
Books
Transcript
Episode notes
Correctness Drives AI Economics
- Generative AI changes core economics because maintaining correctness becomes central to costs and strategy.
- Correctness can shift a product from high-margin software to low-margin service-like economics.
Foundation Models Add A New Cost Layer
- Foundation models add a fourth architectural component: app, database, compute, and model.
- That extra component fundamentally alters cost structure and competitive dynamics.
Account For Initial And Ongoing Compute Costs
- Calculate compute costs including energy, cooling, and cloud provider bills for both initial build and ongoing inference.
- Choose between self-hosting a downloaded model or using cloud APIs based on upfront versus recurring cost tradeoffs.


