Practical AI cover image

Large models on CPUs

Practical AI

00:00

Exploring Sparsity in Large Models

This chapter explores the concept of sparsity in large models, revealing how up to 95% of parameters can be eliminated without sacrificing performance. It discusses the benefits of optimizing large language models for greater speed and efficiency, particularly in transitioning from GPU to CPU processing.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app