Varying optimization horizons in gridworld

They vary planning horizons from 1 to full and show a phase transition where longer horizons enable bad hacks.

Play episode from 01:21:17

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!