Using Masking in Multitask Learning

The interference phenomenon is tricky, and I know a lot of people are working on that. It's kind of interesting, though, because you can think of that multitask learning or having a single multitask network as like architectically just a picture of you've got this backbone,. You get your representation, and then you have a couple of task-specific heads. But then in this case, now you've got sort of a full perhaps end-to-end network or something like that. And then the image you presented at inference time is really just these masks you're now applying to the network instead of something that's kind of locked on to it.

Play episode from 45:02

Transcript

Episode notes

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app