Is the Schedule Learning?

The schedule is just like kind of a nice property that makes sense rather than it being specifically what you're optimizing for. It's more just like something that emerges during training. PBT will always pick the right learning rate to the next chunk of training steps, but not necessarily the one which meets again in five chunks time. Another class of methods for solving the same online home from students' RL is met ingredients. They have this page of good, sharp met ingredients, which is a way to slightly incorporate the future of your training in your updates.

Play episode from 24:01

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app