AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
How to Train a Language Model
I would love if you could go over the different steps from the self-supervised part to the fine tuning to the reinforcement learning with human feedback. Like how would you explain all those quite complicated steps in simple words? Yeah. So training works. One of the things that makes these models work now is that we can have a lot of data that is unlabeled and train the model. And so you can see that we can just slide this window and create millions or billions of training examples. This is why they're called language models. Now that turned out to be one of the most magical things, one of the biggest returns of investments that maybe the technology system was like if human technology