How to Make an Automated Alignment Researcher

I think AI is on average more creative than humans. And then in terms of long run goals, I think this is actually not needed at all. We can hand off like pretty small well scoped tasks to AI systems that if they really nailed those, it would be really useful. That could be things like here's like the paper that we just wrote, please suggest like some next steps or like some new experiments to do. If you imagine having a really a star researcher that you can ask these questions, they only have to optimize over the next few thousand tokens. And if they do that super well, then you would get a lot of value from them.

Play episode from 12:22

Transcript

Episode notes

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app