
The InfoQ Podcast
Meryem Arik on LLM Deployment, State-of-the-art RAG Apps, and Inference Architecture Stack
Jun 10, 2024
Meryem Arik, Co-founder/CEO at TitanML, talks about the latest trends in generative AI and Large Language Model (LLM) technologies. She discusses LLM Deployment, state-of-the-art Retrieval Augmented Generation (RAG) apps, and the inference architecture stack for LLM applications. The conversation also touches on advancements in LLM technology, industry adoption, tips for LLM deployment, and the importance of AI regulation.
37:56
Episode guests
AI Summary
AI Chapters
Episode notes
Podcast summary created with Snipd AI
Quick takeaways
- Deployment of generative AI in regulated industries emphasizes on-prem or VPC solutions.
- Rapid innovation in Gen AI includes smaller models matching GPT-3.5 performance and GPT-4.0's multimodal abilities.
Deep dives
Generative AI Solutions in Regulated Industries
Miriam Aarik, Co-founder and CEO of Titan ML, discusses deploying generative AI in regulated industries, focusing on LLMs deployment in regulated environments. TitanML assists regulated industries in deploying generative AI solutions within their infrastructure, emphasizing on-prem or VPC deployment to overcome infrastructure challenges for enterprises.
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.