The Inside View cover image

Curtis Huebner on Doom, AI Timelines and Alignment at EleutherAI

The Inside View

00:00

The Next Step in Model Based RL

Right now we're kind of, you know, putting aside the PPO policy for a little bit. And we're going to try and focus on like model based RL. The next step is just training models, generative models in my test. I expect to see things like tracking state. Like if you see a tree and then you turn to the left and then you turned back, presumably there's some machinery inside the generative model that is tracking the state of whether that tree is there.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app