The InfoQ Podcast cover image

Meryem Arik on LLM Deployment, State-of-the-art RAG Apps, and Inference Architecture Stack

The InfoQ Podcast

00:00

Intro

Exploring the significance of deploying large language models in regulated industries like deploying generative AI within on-premises or VPC infrastructure, featuring the insights and journey of Titan ML's co-founder and CEO in bridging research and enterprise to tackle AI infrastructure challenges.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app