The Role of Humans in AI Alignment

I would be excited to be replaced by AI. I think humans should always stay in the loop somehow. There's this quote from planning for AGI and beyond that says, it's possible that AGI capable enough to accelerate its own progress because major changes happen surprisingly quickly. And then it says we think a slower takeoff is easier to make safe. So one thing I wonder is like if we make this really smart or, you know, human level alignment researcher that we then like effectively 10x or 100x or something, does that end up playing into this like recursive self improvement loop? You can't have recursion without also improving your alignment a lot. That's just no way that that

Transcript

Play full episode

Transcript

Episode notes

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app