AI + a16z cover image

Inferact: Building the Infrastructure That Runs Modern AI

AI + a16z

00:00

Origins of vLLM and the research spark

Woosuk and Simon recount how vLLM began as a Berkeley research prototype to speed open-weight LLM demos.

Play episode from 02:35
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app