Using Masking to Probe Pre-Trained Models | 3min snip from The Gradient: Perspectives on AI

Hattie Zhou: Lottery Tickets and Algorithmic Reasoning in LLMs

The Gradient: Perspectives on AI

NOTE

Using Masking to Probe Pre-Trained Models

The super mask is a technique that involves identifying sub networks in randomly initialized networks that can perform well on a given task without any training of the underlying weights/nThe super mask approach can be used as an alternative to training neural networks by doing a search over sub networks of a randomly initialized model/nThe idea of masking can also be used to probe a pre-trained model by identifying sub networks responsible for certain characteristics or for learning certain features

00:00

Transcript

Play full episode

Transcript

Episode notes

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.