The Opposition to Attention

For the people who are skeptical of this direction, what are their objections? One objection is that attention works fine for the current models. When you start increasing the sequence length, which I think for lots of applications, then attention starts to become unwieldy or becomes a bottleneck again. So we just took a different path, which is, hey, if we, what if we try other alternatives that could also work quite well?"

Transcript

Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app