
43 - David Lindner on Myopic Optimization with Non-myopic Approval
AXRP - the AI X-risk Research Podcast
00:00
How MONA addresses reward hacking
David and Daniel unpack how myopic incentives prevent agents from investing in future reward-hacking strategies.
Play episode from 02:40
Transcript


