
43 - David Lindner on Myopic Optimization with Non-myopic Approval
AXRP - the AI X-risk Research Podcast
00:00
Varying optimization horizons in gridworld
They vary planning horizons from 1 to full and show a phase transition where longer horizons enable bad hacks.
Play episode from 01:21:17
Transcript


