AXRP - the AI X-risk Research Podcast cover image

24 - Superalignment with Jan Leike

AXRP - the AI X-risk Research Podcast

00:00

How the Super Alignment Team Is Relating to Other Things at Open EI Like Efforts to Make Chat GPT Nicer Minimize on Our Sources

The super alignment team at open EI is working on ways to make chat GPT nicer minimize on our sources or something like that. There's a lot of things that you could mean by AI governance and one thing they're working on is how do we evaluate the models dangerous capabilities? The feedback mechanisms where like can we help make alignment of GPT-5 better with our techniques, he says.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner
Get the app