
Episode 18: Oleh Rybkin, UPenn, on exploration and planning with world models
Generally Intelligent
How to Train a Dynamic Model on Utope?
If we just train a current dynamics model on utopit, it's going to fail dramatically. It's not in te scale. We need to design much better algrithms that can handle the data. And now wha likewu have a lot of methods for training late in variable models. So you can actually thrain this moral even though your action is latent and ticlare. You can thain it with veratinal inference. There's actually a cool sort of problistic math at figuring out how to learn a latent action representation from passive vites that don't have any actions. Some other bitutetat pen carl schaeber actualltested
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.