The a16z Show cover image

Inferact: Building the Infrastructure That Runs Modern AI

The a16z Show

00:00

Global deployment and diversity

Simon shares vLLM usage scale, diverse GPU architectures, and why one-size-fits-all solutions don't work.

Play episode from 25:19
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app