AI Engineering Podcast cover image

AI Engineering Podcast

Running Generative AI Models In Production

Oct 28, 2024
57:37
Snipd AI
Philip Kiely, an AI infrastructure expert at BaseTen, dives into the complexities of running generative AI models in production. He shares insights on the importance of selecting the right model based on product requirements and discusses key deployment strategies, including architecture and performance monitoring. Challenges like model quantization and the balance between open-source and proprietary models are explored. Philip also highlights future trends such as local inference, emphasizing the need for compliance in sectors like healthcare.
Read more

Podcast summary created with Snipd AI

Quick takeaways

  • Understanding product strategy is crucial, as it influences model selection and the overall approach to AI deployment.
  • The architectural evolution of AI applications, particularly through compound methods, presents increased complexity in orchestration and inference.

Deep dives

Understanding Open Models

The concept of open models is defined through parameters like open weights, data, and code, which facilitate transparency in AI development. True open models must meet stringent criteria that allow users to access the full process of model creation and deployment. However, practical interpretations can vary, with models such as Meta's Llama challenging the traditional definitions by utilizing custom licenses. As more companies attempt to balance openness with proprietary interests, the industry must collaboratively navigate the complexities of licensing while ensuring a fertile environment for innovation.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode