BBTN Genmels

When you're doing this generational training and doing the October search, when you reset to a new generation, what you notice is that on textures, you get bigger over time. And we asked some examples of this in the paper. But I know another thing that we also see is the value function that learns a bigger network architecture than the policy. Oh Yeah. That does kind of make sense. It's really interesting. So yeah, I'm trying to hopefully encourage them to keep working on it because there's still more things to do.

Transcript

Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app