AXRP - the AI X-risk Research Podcast cover image

8 - Assistance Games with Dylan Hadfield-Menell

AXRP - the AI X-risk Research Podcast

00:00

Communication Equilibrium

There are different types of what that that is an example of what i would call a communication equilibrium, where you are in coding your preferences into a language or into symbols. And if those are mismatched, you have no bounds on the performance of the resulting robot policy. In imitation learning style version of this problem, person just says like, ok, let me figure how to solve the maze. Do my best. The robot will try to copy that. If we're doing iorel with the right kind of prior in things like that, the robot can actually learn and perhaps improve on the person's performance.

Play episode from 48:38
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app