AXRP - the AI X-risk Research Podcast cover image

21 - Interpretability for Engineers with Stephen Casper

AXRP - the AI X-risk Research Podcast

00:00

How to Evaluate the Input Synthesis Methods

Can you share the data sets produced by the code? Like rather than the code itself? Yeah, that sounds like a pretty easy thing to do. You know, just like producing a bunch of examples of like, in particular, patch and style and natural feature images that were relabeled as part of these data poisoning. But let me put that on a list. I'll work on this if I can, and I will especially work on this or someone explicitly asked me to. And it sounds like maybe you are mostly in, I guess, in podcast format. So I guess another question is the way you evaluated the input synthesis methods,. Which was essentially like you ran a survey, right

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app