27min chapter

AI Engineering Podcast cover image

Running Generative AI Models In Production

AI Engineering Podcast

CHAPTER

Deploying Generative AI: Architecture and Insights

This chapter focuses on the complexities of deploying generative AI models, particularly in selecting appropriate application architectures. It highlights various frameworks and approaches, such as retrieval augmented generation and quantization, while emphasizing the importance of model performance and reliability in production. Additionally, the chapter discusses the need for monitoring, addressing concept drift, and ensuring models remain relevant in changing real-world conditions.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode