The InfoQ Podcast cover image

Meryem Arik on LLM Deployment, State-of-the-art RAG Apps, and Inference Architecture Stack

The InfoQ Podcast

CHAPTER

Exploring LLM Deployment, State-of-the-art RAG Apps, and Inference Architecture Stack

Exploring the characteristics and key components of a state-of-the-art RAG app deployment, highlighting the significance of data pipelines, embedding search, and semantic search.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner