AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Transformers vs a G C?
In transformers, it is explicitly a quadratic term. And you achieve the permutation a in variance by looking at all the c tupples of the items in the setwar cicles too,. So you can think of transformers as being a graft learning problem. There might be intere ing a lessons we can learn from transformers that will make better genan architectures.