
#66 – Michael Cohen on Input Tampering in Advanced RL Agents
Hear This Idea
00:00
The Importance of Uncertainty in Imitating Learning
If the uncertainty of the imitation learner surpasses the extent of the quantization, you're not really quantilizing anymore. You're really just optimizing. The thought is maybe it's actually quite a delicate balance to interpolate in just the right way to get something which is both safe and actually good.
Play episode from 02:18:19
Transcript


