The Introduction of Transformers as Programmable Computers

In this talk you gave at the Alan Turing Institute you kind of expounded on how RNNs relate to the models of computation we're familiar with. You made the argument that in the hierarchy they lie closer to finite state machines I don't necessarily have access to unbounded memory than to Turing machines. How do you think about the introduction of transformers in terms of that computational hierarchy question? That's an interesting question to ask specifically because I had one of my student Minchi Jiang sent me a paper just the other day on this topic is called Looped Transformers as Programmable Computers by Angelique Genu and colleagues.

Play episode from 47:23

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app