
8 - Assistance Games with Dylan Hadfield-Menell
AXRP - the AI X-risk Research Podcast
00:00
Communication Equilibrium
There are different types of what that that is an example of what i would call a communication equilibrium, where you are in coding your preferences into a language or into symbols. And if those are mismatched, you have no bounds on the performance of the resulting robot policy. In imitation learning style version of this problem, person just says like, ok, let me figure how to solve the maze. Do my best. The robot will try to copy that. If we're doing iorel with the right kind of prior in things like that, the robot can actually learn and perhaps improve on the person's performance.
Play episode from 48:38
Transcript


