How to Train a Mind Tester to Punch Trees

The project is not quite there yet. And sort of like we're going on, you know, kind of intermediate tangents. So I believe like in the first blog post, what we mainly outlined was a basic PPO policy to kind of punch trees. Can we do some basicinterpretability on that? What does interpretability on an RL policy look like? What does it feel like? Do we kind of bump into problems that we don't really bump into in kind of other kinds of environments or training settings?"

Play episode from 01:11:39

Transcript

Episode notes

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app