
8 - Assistance Games with Dylan Hadfield-Menell
AXRP - the AI X-risk Research Podcast
00:00
The Line of Work on Inverse Reward Design
Inverse reward design has been extended in a couple of directions to look at both active learning as well as a using it to fuse multiple reward functions into a single one. We've done some work on the mechanics of algaritms within co operative irel. And we actually give an efficient algorithm for computing optimal co operative ioral strategy pairs.
Play episode from 02:05:31
Transcript


