The Ruby AI Podcast cover image

Running Self-Hosted Models with Ruby and Chris Hasinski

The Ruby AI Podcast

00:00

Monitoring locally hosted models and APM needs

Chris discusses monitoring LLM latency, time-to-first-token, and treating models like database bottlenecks for observability.

Play episode from 29:03
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app