AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
How to Tell the Difference Between Good and Bad Work
I tried to help people along the way and I don't Think they got far enough like some of them got some distance, but they they didn't turn into alignment specialists doing great work. It's it's the problem of the broken verifier If somebody had a bunch of talent in physics and were like well like I want to work in this field I might be like well, there's interpretability. But that's not the same as having the thing inside them that generalizes correctly without anybody standing over their shoulder Enforcing them to get the right answerget the right answer. The entire schooling process of like here is this legible question that you're supposed to have already been taught how to