Deep Papers cover image

Hungry Hungry Hippos - H3

Deep Papers

CHAPTER

Can SSMs Do More Than Transformers?

SSM has a property they can compute it in almost linear time. So just from the kind of a computational standpoint, you can just use much longer sequence. And that's one example of something that SSMs have shown a lot more power over than transformers. Jason: Can you give me an example of a long range task that in SSM would do much better than a transformer?

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner