The Inside View cover image

Curtis Huebner on Doom, AI Timelines and Alignment at EleutherAI

The Inside View

00:00

How to Train a Mind Tester to Punch Trees

The project is not quite there yet. And sort of like we're going on, you know, kind of intermediate tangents. So I believe like in the first blog post, what we mainly outlined was a basic PPO policy to kind of punch trees. Can we do some basicinterpretability on that? What does interpretability on an RL policy look like? What does it feel like? Do we kind of bump into problems that we don't really bump into in kind of other kinds of environments or training settings?"

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app