AXRP - the AI X-risk Research Podcast cover image

24 - Superalignment with Jan Leike

AXRP - the AI X-risk Research Podcast

00:00

How to Make AI Systems More Aligned

There's a lot of important open problems here that leads to be like we need a lot of progress on, says Jeff Smith. There are questions around how do we make today's models more aligned right like solve hallucinations and jailbreaking try to improve monitoring. The AI ethics community is really interested in of like can we get the systems to be less biased and underrepresented groups of viewers and so on. I think there's actually like a lot more scope for theory work than people are currently doing but theoretical work is generally hard because it requires assumptions that don't hold true.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app