

Self-hosting & scaling models (Practical AI #243)
Oct 31, 2023
Tuhin, an expert in model deployment and monitoring at any scale, joins the show to discuss self-hosting open access models. They explore trends in tooling and usage of open access models, common use cases for integrating self-hosted models, and how the boom in generative AI has influenced the ecosystem.
Chapters
Transcript
Episode notes
1 2 3 4 5 6
Introduction
00:00 • 2min
Introduction and Advancements in ML and AI
01:32 • 28min
Trends in Model Deployments and the Future of Running Models on Devices
29:05 • 4min
Challenges of incorporating large language models in an air-gapped edge environment
33:09 • 2min
Exploring the Future of Model Hosting Infrastructure
35:26 • 3min
The Significance of Caching and Multi-Cloud Environments for Model Deployment
38:28 • 3min