The InfoQ Podcast cover image

Meryem Arik on LLM Deployment, State-of-the-art RAG Apps, and Inference Architecture Stack

The InfoQ Podcast

CHAPTER

Intro

Exploring the significance of deploying large language models in regulated industries like deploying generative AI within on-premises or VPC infrastructure, featuring the insights and journey of Titan ML's co-founder and CEO in bridging research and enterprise to tackle AI infrastructure challenges.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner