AXRP - the AI X-risk Research Podcast cover image

8 - Assistance Games with Dylan Hadfield-Menell

AXRP - the AI X-risk Research Podcast

00:00

Aligning Recommender Systems With Human Values

The value of assistance games is giving us a category to identify these types of interiventions and move them from features of practice to objects of study. A i think it depends on what you imagine those object like that fata to capture wer were data is the variable that we represent the reward function with. And so one of the assumptions in co operative iral that we've relaxed a is looking at this particular one, where the fact that the person has complete knowledge of the objective seems perhaps fishy.

Play episode from 41:07
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app