Spring Office Hours cover image

S5E02 - Topic Smorgasbord

Spring Office Hours

00:00

Prompt caching to save token costs

Dan outlines caching system prompts, tool lists, and other stable payloads to reduce token usage in high-throughput apps.

Play episode from 37:55
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app