
Philip Kiely
Head of Developer Relations at Baseten, an AI infrastructure platform that helps companies deploy AI models. Kiely is an expert in AI transcription and the Whisper model.
Top 3 podcasts with Philip Kiely
Ranked by the Snipd community

36 snips
Oct 17, 2024 • 57min
Compound AI Systems with Philip Kiely - Weaviate Podcast #105!
Philip Kiely, the leading developer relations at Baseten, shares insights on compound AI systems and their evolution. He discusses breaking tasks into multiple stages for better AI model performance. The conversation covers advancements in multimodal AI and strategies for deploying these systems efficiently. Kiely emphasizes the benefits of smaller models and constrained generation techniques, alongside architectural tips for Kubernetes deployment. Key comparisons are made between various model serving frameworks, focusing on optimizing AI performance while minimizing costs.

31 snips
Oct 28, 2024 • 58min
Running Generative AI Models In Production
Philip Kiely, an AI infrastructure expert at BaseTen, dives into the complexities of running generative AI models in production. He shares insights on the importance of selecting the right model based on product requirements and discusses key deployment strategies, including architecture and performance monitoring. Challenges like model quantization and the balance between open-source and proprietary models are explored. Philip also highlights future trends such as local inference, emphasizing the need for compliance in sectors like healthcare.

Dec 3, 2025 • 57min
SE Radio 697: Philip Kiely on Multi-Model AI
Philip Kiely, the software developer relations lead at BaseTen, dives into the realm of multi-agent AI. He advocates for building AI-native products through the composition of multiple models and agents that take action, moving beyond simple ChatGPT interfaces. Kiely highlights the shift to custom solutions driven by domain-specific needs and economic considerations. He emphasizes the importance of safety, trust, and iterative experimentation in AI engineering while discussing practical applications like a D&D assistant evolving into a multimodal agent.


