AI Safety Fundamentals: Alignment cover image

The Easy Goal Inference Problem Is Still Hard

AI Safety Fundamentals: Alignment

00:00

The Importance of Inverse Reinforcement Learning

Inverse reinforcement learning is a powerful approach to AI. But in the long term, many important applications require AI's to make decisions which are better than those of available human experts. In this context, we can't use the normal paradigm: more accurate models are better. A perfectly accurate model would take us exactly to human mimicry in no farther. The possible extra oomph of inverse reinforcement learning comes from an explicit model of the human's mistakes or bounded rationality. This research can help identify other practical approaches to AI control that can then be explored empirically.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app