Towards Data Science cover image

96. Jan Leike - AI alignment at OpenAI

Towards Data Science

00:00

Recursive Reward Modelling

The general idea of requestieward modelling is am. It's like we train machine learning in models to help us evaluate the task. So if it's an easy enough task, we just know how to do that task. You shand strain it and em so for example, there one of the evaluation of tasks is the task of fenox answering questions about the book,. And so what do as you like, just take that task, you train a separate model, and you like, look here. Now i'm just straining a model to get really good at answering questions about longer pieces of text. The aim with curso reward modelling was to build some kind of hypotheses on which

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner
Get the app