AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
How RLHF makes ChatGPT better
Chat GPT is a term for a model that is trained on text data and can do amazing things, but is not very useful or easy to use./nRLHF is how humans help to make the model more useful and easier to use./nLess data is required for RLHF to work well than for the original model, and the process of incorporating human feedback is very interesting.