The Data Exchange with Ben Lorica

The Infrastructure for Production AI

Oct 9, 2025
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Hardware-Software Co-Design Matters

  • AI-first cloud requires tight integration of hardware and software to serve compute-bound, high-bandwidth workloads.
  • Designing from first principles enables smarter caching and distributed sharding across storage and memory layers.
ADVICE

Cache Closer To The Metal

  • Cache at lower levels than bucket or block storage, including shared memory, to accelerate large model workloads.
  • Gain low-level hardware access to implement distributed smart caching across a global fleet.
INSIGHT

Production Reveals Infra Gaps

  • Production and longer training runs reveal reliability and operational gaps that short experiments hide.
  • Self-healing, health checks, and orchestration are critical as runs lengthen and faults become inevitable.
Get the Snipd Podcast app to discover more snips from this episode
Get the app