AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Large Language Models Are in Context Learners
In the original paper on GPT three, they said large language models are in context learners. And that would be referred to as doing it in zero-shot, that you're not giving it any correct examples of how to translate. But there's another interpretation that I kind of liked better, which is that pre-trained models modeling a like multiverse of fictional documents. When you prompt the model, you're sort of in a superposition of like all possible documents that might continue from this one. It has pre-trained knowledge that it's leveraging there.