AXRP - the AI X-risk Research Podcast cover image

12 - AI Existential Risk with Paul Christiano

AXRP - the AI X-risk Research Podcast

00:00

How Does It Help in Outer Linear Type Settings?

We're concerned about models which like, sort of ar deliberately obfiscating what's happening on camera. And how does that help in these, like, outer linean type settings? Yes, i think the biggest thing is that, like imaging your model again, which is predicting s from the future. I am very interested in techniques where, ai systems are helping humans do the evaluation. So you kind of imagine thes slight, gradual, inductive process where as your ai gets better, they help the humans answer harderand harder questions to allow the data to get ever better. Meiferent versions of dunchoo and Eertz both use eyes to traine eyes

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app