
8 - Assistance Games with Dylan Hadfield-Menell
AXRP - the AI X-risk Research Podcast
00:00
Aligning Recommender Systems With Human Values
The value of assistance games is giving us a category to identify these types of interiventions and move them from features of practice to objects of study. A i think it depends on what you imagine those object like that fata to capture wer were data is the variable that we represent the reward function with. And so one of the assumptions in co operative iral that we've relaxed a is looking at this particular one, where the fact that the person has complete knowledge of the objective seems perhaps fishy.
Play episode from 41:07
Transcript


