How to Maximize Your Debate

The key to me is like, I don't expect the system to care about anything other than its objective. A priority makes it very difficult to claim that we could expect this thing to learn to care about whatever we tried to get it to learned to care about. The idea of orthogonality can be totally de correlated with intelligence and goals. It's not if it understands that you want it to make paper clips. Great. That doesn't mean it's going to ignore its implicit incentives to seek power just because they are in an adversarial relationship relative to humans.

Transcript

Play full episode

Transcript

Episode notes

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app