Towards Data Science cover image

96. Jan Leike - AI alignment at OpenAI

Towards Data Science

00:00

Recursive Reward Modelling

The general idea of requestieward modelling is am. It's like we train machine learning in models to help us evaluate the task. So if it's an easy enough task, we just know how to do that task. You shand strain it and em so for example, there one of the evaluation of tasks is the task of fenox answering questions about the book,. And so what do as you like, just take that task, you train a separate model, and you like, look here. Now i'm just straining a model to get really good at answering questions about longer pieces of text. The aim with curso reward modelling was to build some kind of hypotheses on which

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app