
Hungry Hungry Hippos - H3
Deep Papers
Using Multiplier Interactions in RNN Modeling
The way that we, the physical mechanism that we use, it's called a multiplicative gate. All that means is that we take the outputs of these two hippos and we multiply them. So now we can bring out Obama as the, as the next word. And then the second hippo that is remembering words through the whole sequence,. Then you get to Michelle, and then you can say, oh, there's a president that appeared some time ago. I think this gating idea actually goes back a long time to even in RNN things like long term, long short term memory LSTM.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.