Super Data Science: ML & AI Podcast with Jon Krohn cover image

939: Mixture-of-Experts and State-Space Models on Edge Devices, with Tyler Cox and Shirish Gupta

Super Data Science: ML & AI Podcast with Jon Krohn

00:00

Mamba and Hybrid Architectures for Long Contexts

Jon asks how Mamba improves scaling and Tyler explains linear context scaling, memory savings, and Mamba's role in Granite 4H.

Play episode from 42:00
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app