Software Engineering Radio - the podcast for professional software developers cover image

SE Radio 697: Philip Kiely on Multi-Model AI

Software Engineering Radio - the podcast for professional software developers

00:00

Inference engineering and runtime configuration

Philip emphasizes dynamic runtime configs, quantization, batching, and speculation as key levers in inference engineering.

Play episode from 44:32
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app