What Are the Major Ingredients of Transformers?

RNNs and LSTMs are taught as kind of like this is maybe a historical artifact in deep learning courses. Do you think there are insights remaining to be gleamed from RNNs and these earlier architectures, despite the fact that NLP now seems to be all about transformers? Oh yeah, absolutely. And it seems you were right. I do have a few questions about this set of papers. The first is just thinking through the evolution of things over time. After attention got taken forward and really taken to its logical endpoint with the development of the transformer. But as entities in themselves, at least from my perspective, I don't see nearly as much attention being paid to them

Play episode from 01:01:18

Transcript

Episode notes

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app