AXRP - the AI X-risk Research Podcast cover image

24 - Superalignment with Jan Leike

AXRP - the AI X-risk Research Podcast

00:00

How to Make an Automated Alignment Researcher

I think AI is on average more creative than humans. And then in terms of long run goals, I think this is actually not needed at all. We can hand off like pretty small well scoped tasks to AI systems that if they really nailed those, it would be really useful. That could be things like here's like the paper that we just wrote, please suggest like some next steps or like some new experiments to do. If you imagine having a really a star researcher that you can ask these questions, they only have to optimize over the next few thousand tokens. And if they do that super well, then you would get a lot of value from them.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner
Get the app