AXRP - the AI X-risk Research Podcast cover image

24 - Superalignment with Jan Leike

AXRP - the AI X-risk Research Podcast

CHAPTER

Automating 99.9% of Alignment Research

The system is adding a lot of value, but like really excelling in these tasks. And so the system doesn't need to pursue long run goals. Now is that going to be competitive with like, you know, did you enter an arrow on like, did you get the solution right in the long run? That is very unclear. But at the very least, you can use this kind of broken down step by step set up, get the system to do a lot of really useful things that humans would have done and then piece it together.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner