The Gradient: Perspectives on AI cover image

Hattie Zhou: Lottery Tickets and Algorithmic Reasoning in LLMs

The Gradient: Perspectives on AI

CHAPTER

Using Masking in Multitask Learning

The interference phenomenon is tricky, and I know a lot of people are working on that. It's kind of interesting, though, because you can think of that multitask learning or having a single multitask network as like architectically just a picture of you've got this backbone,. You get your representation, and then you have a couple of task-specific heads. But then in this case, now you've got sort of a full perhaps end-to-end network or something like that. And then the image you presented at inference time is really just these masks you're now applying to the network instead of something that's kind of locked on to it.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner