AXRP - the AI X-risk Research Podcast cover image

24 - Superalignment with Jan Leike

AXRP - the AI X-risk Research Podcast

CHAPTER

How to Make a Smart AI Alignment Researcher

You basically need two parts. One is you needed system that is smart enough to do it and then the second part is you need to align it to actually do it. And I'm personally not working on the first one, but people are working hard of making it happen. There's a lot of different ways that you go, but you could just picture like pre-training a much larger model. Eventually it will just be smart enough. On the second part, that's the part I'm really interested in. How do you get it to actual do alignment research in the way we would want it to?

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner