Changelog Master Feed

Self-hosting & scaling models (Practical AI #243)

Oct 31, 2023

Tuhin, an expert in model deployment and monitoring at any scale, joins the show to discuss self-hosting open access models. They explore trends in tooling and usage of open access models, common use cases for integrating self-hosted models, and how the boom in generative AI has influenced the ecosystem.

Ask episode

Chapters

Transcript

Episode notes

Introduction and Advancements in ML and AI

01:32 • 28min

Trends in Model Deployments and the Future of Running Models on Devices

Challenges of incorporating large language models in an air-gapped edge environment

Exploring the Future of Model Hosting Infrastructure

The Significance of Caching and Multi-Cloud Environments for Model Deployment