
Episode 01: Kelvin Guu, Google AI, on language models & overlooked research problems
Generally Intelligent
00:00
The Magic Behind Language Model Pre Training
I've been quite excited about fushot learning and different places you can apply that. How is it that despite many tasks not looking like language model prediction, you can still get so much generalization from that task? And also the well known gpt three results with the incontext learning. Just understanding where the model gets its bias for pattern continuation and repetition,. That's been on my mind, learning more about causal inference.
Transcript
Play full episode