AXRP - the AI X-risk Research Podcast cover image

8 - Assistance Games with Dylan Hadfield-Menell

AXRP - the AI X-risk Research Podcast

00:00

The Line of Work on Inverse Reward Design

Inverse reward design has been extended in a couple of directions to look at both active learning as well as a using it to fuse multiple reward functions into a single one. We've done some work on the mechanics of algaritms within co operative irel. And we actually give an efficient algorithm for computing optimal co operative ioral strategy pairs.

Play episode from 02:05:31
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app