The Gradient: Perspectives on AI cover image

Kyunghyun Cho: Neural Machine Translation, Language, and Doing Good Science

The Gradient: Perspectives on AI

CHAPTER

What Are the Major Ingredients of Transformers?

RNNs and LSTMs are taught as kind of like this is maybe a historical artifact in deep learning courses. Do you think there are insights remaining to be gleamed from RNNs and these earlier architectures, despite the fact that NLP now seems to be all about transformers? Oh yeah, absolutely. And it seems you were right. I do have a few questions about this set of papers. The first is just thinking through the evolution of things over time. After attention got taken forward and really taken to its logical endpoint with the development of the transformer. But as entities in themselves, at least from my perspective, I don't see nearly as much attention being paid to them

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner