MLOps.community  cover image

LLMs at Scale: Infrastructure That Keeps AI Safe, Smart & Affordable // Marco Palladino// # 341

MLOps.community

00:00

Universal model API and routing for cost and latency

Marco describes Kong's universal API that lets teams switch vendors, route by prompt complexity, and optimize cost and latency.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app