AXRP - the AI X-risk Research Podcast cover image

24 - Superalignment with Jan Leike

AXRP - the AI X-risk Research Podcast

00:00

The Importance of Cross Validation Techniques

I think the best case would be that they actually complement more than they do the same thing. If you can understand or like you can improve how the models are generalizing then then it gives you a way to leverage the models internals for whatever you're trying to do. You really like training a model to like tell you what you want to hear like what you believe but it might not be what the model believes. I don't know if there's this hypothesis that pre-trained language models are just like ensembles of different personas right and you might extract the beliefs of one into another.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner
Get the app