The InfoQ Podcast cover image

Meryem Arik on LLM Deployment, State-of-the-art RAG Apps, and Inference Architecture Stack

The InfoQ Podcast

00:00

Exploring LLM Deployment, State-of-the-art RAG Apps, and Inference Architecture Stack

Exploring the characteristics and key components of a state-of-the-art RAG app deployment, highlighting the significance of data pipelines, embedding search, and semantic search.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app