Chapters
Transcript
Episode notes
1 2 3 4 5 6 7 8 9 10 11 12
Introduction
00:00 • 3min
The Importance of Smaller Models in Neural Magic
02:39 • 2min
The Cost of Deployment on GPUs and CPUs
04:54 • 3min
The Importance of GPUs for Large Models
07:30 • 2min
How to Optimize a Large Language Model
09:39 • 2min
How to Optimize Your Image for Res.net 50
11:54 • 4min
The Importance of Space and Sparsity in GPUs
15:28 • 3min
The Importance of Intuition in Quantization
18:45 • 5min
The Open Source Community for Sparse Models
23:50 • 2min
How to Optimize a Model in PyTorch
26:11 • 2min
The Importance of Usability in Optimization Platforms
28:00 • 4min
The Future of Machine Learning
32:26 • 6min