5min chapter

Machine Learning Street Talk (MLST) cover image

#77 - Vitaliy Chiley (Cerebras)

Machine Learning Street Talk (MLST)

CHAPTER

The Problem With Sparsityn in Activations and Wedg

The Serab architecture allows you to come iteratively sparsified in your network. But, i don't think it's necessarily zero, which means something is sparse. The whole point of this thing is to kind of dramatically improve the memory complexity of the network. If one of those guys is still huge, then does that the cash value of that? Yes. And a, to base off of what we've been talking about, what we'd heard,. i'd highly recommend other people consider working there too.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode