Yannic Kilcher Videos (Audio Only) cover image

How to make your CPU as fast as a GPU - Advances in Sparsity w/ Nir Shavit

Yannic Kilcher Videos (Audio Only)

00:00

Is There a Type of Sparse Model for Neural Network Execution?

In a neural network, there are overlaps between columns. How do you deal with the overlaps in a way that doesn't kill your computation? That's the magic of it. There's an algorithm that allows you to do that. And because you can do it, you manage to run this way and you don't hit this memory bottleneck and boom, you're in business. Yeah. So for GPU, it's almost like, you know, GPUs enable us to do dense models. But I think also models have almost co-evolved with theGPU. People have started building models to fit the NVIDIA architectures better. Especially something like a transformer is like, that's it. That

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app