AXRP - the AI X-risk Research Podcast cover image

12 - AI Existential Risk with Paul Christiano

AXRP - the AI X-risk Research Podcast

CHAPTER

How Does It Help in Outer Linear Type Settings?

We're concerned about models which like, sort of ar deliberately obfiscating what's happening on camera. And how does that help in these, like, outer linean type settings? Yes, i think the biggest thing is that, like imaging your model again, which is predicting s from the future. I am very interested in techniques where, ai systems are helping humans do the evaluation. So you kind of imagine thes slight, gradual, inductive process where as your ai gets better, they help the humans answer harderand harder questions to allow the data to get ever better. Meiferent versions of dunchoo and Eertz both use eyes to traine eyes

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner