Alignment Fundamentals and Interpretability

"We have never shared the world with something that was as powerful as we are but had a very different way of functioning for which we had no internal models. We had no way of predicting what it was going to do like this this is different in an important dimension," he says. "I think people are thinking of this just as a technology, and instead they should be thinking of it as a new kind of intelligence."

Play episode from 52:00

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app