AXRP - the AI X-risk Research Podcast cover image

21 - Interpretability for Engineers with Stephen Casper

AXRP - the AI X-risk Research Podcast

00:00

The Hardness of Hypothesis Generation

Coming up with hypotheses seems about as hard as like program synthesis or like program translation, it's not clear to me why this is. I think what you just described is would probably be one of the best ways to try to move forward on a problem like this. And if we if we wanted to do it particularly rigorously, if we were like, you know, if we shop for the moon, we might land on the ground, because it might be very hard. Just turn a network into a piece of code that describes it.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app