Weaviate Podcast cover image

Charles Packer on MemGPT - Weaviate Podcast #73!

Weaviate Podcast

CHAPTER

Ideal running speed of a meta model and working with billion parameter models

This chapter explores the challenges and solutions for achieving the ideal running speed in high parameter models, including the benefits of using quantized models and running parallel models for faster decoding speed.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner