AXRP - the AI X-risk Research Podcast cover image

24 - Superalignment with Jan Leike

AXRP - the AI X-risk Research Podcast

00:00

The Interaction of Cross Validation and Interpretation

Train via the generalization technique is that just like train on the easy problems then they turned out to generalize the hard problems or so if you understand how your models generalize from easy to hide and you can like make it generalize really well. You might hope you could cross validate for example but with cross validation you have to have a different split of your training set right? So what I mean with cross validation here is like you have one training run where you train using the generalization method and then you validate using interpretability and skill over side and other techniques.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app