Weaviate Podcast cover image

Charles Packer on MemGPT - Weaviate Podcast #73!

Weaviate Podcast

00:00

Ideal running speed of a meta model and working with billion parameter models

This chapter explores the challenges and solutions for achieving the ideal running speed in high parameter models, including the benefits of using quantized models and running parallel models for faster decoding speed.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app