AXRP - the AI X-risk Research Podcast cover image

21 - Interpretability for Engineers with Stephen Casper

AXRP - the AI X-risk Research Podcast

00:00

Softmax Linear Units and the AI Safety Interpretability Community

Softmax Linear Units might be overemphasized or thought of more in isolation as a technique for avoiding disentanglement. A good handful of other techniques are kind of like not emphasized enough, he says. "It's probably pretty useful to emphasize that like lots of other similar things are going on in other places"

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app