4min chapter

The Inside View cover image

Curtis Huebner on Doom, AI Timelines and Alignment at EleutherAI

The Inside View

CHAPTER

How to Train a Mind Tester to Punch Trees

The project is not quite there yet. And sort of like we're going on, you know, kind of intermediate tangents. So I believe like in the first blog post, what we mainly outlined was a basic PPO policy to kind of punch trees. Can we do some basicinterpretability on that? What does interpretability on an RL policy look like? What does it feel like? Do we kind of bump into problems that we don't really bump into in kind of other kinds of environments or training settings?"

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode