
43 - David Lindner on Myopic Optimization with Non-myopic Approval
AXRP - the AI X-risk Research Podcast
00:00
Overview of MONA
David Lindner explains MONA: myopic optimization plus non-myopic human approval to limit multi-step reward hacking.
Play episode from 00:29
Transcript


