The Data Exchange with Ben Lorica

The Infrastructure for Production AI

Oct 9, 2025

Ask episode

AI Snips

Chapters

Transcript

Episode notes

INSIGHT

Hardware-Software Co-Design Matters

AI-first cloud requires tight integration of hardware and software to serve compute-bound, high-bandwidth workloads.
Designing from first principles enables smarter caching and distributed sharding across storage and memory layers.

ADVICE

Cache Closer To The Metal

Cache at lower levels than bucket or block storage, including shared memory, to accelerate large model workloads.
Gain low-level hardware access to implement distributed smart caching across a global fleet.

INSIGHT

Production Reveals Infra Gaps

Production and longer training runs reveal reliability and operational gaps that short experiments hide.
Self-healing, health checks, and orchestration are critical as runs lengthen and faults become inevitable.

Get the Snipd Podcast app to discover more snips from this episode