
98 - Analyzing Information Flow In Transformers, With Elena Voita
NLP Highlights
00:00
The Effects of Different Views on the Same Data
In general, frequent tolkins change a lot, but influence less. And rare tokins change and inorsoof this kind of io, like her frequent tolkand you don't have much information in yourself. But preen tukins can be sort of hobs ogen for some kind of information, such as religion or politics. So it loses information about current tokan identity, but then itmanage to recreate it from the upper layers because it accumulates information fom other sorcins ian sentens.
Transcript
Play full episode