Towards Data Science cover image

121. Alexei Baevski - data2vec and the future of multimodal learning

Towards Data Science

CHAPTER

Perceiverso

Perceiver has an attentional mechanism to incode each type of data. So the query for each modais modality specific. And then you use this query, or set of queries, to to pull information from the underlying sample into a fixed size and time f segments red. This allows you to actually not really think about how to process underlying data. You don't meed to design a full kind of mural architecture that learns how to incode this particular type of modality into latent space. Instead, your architecture for your incoder is fixed. It's much simpler than designing a feature in coder. If we have a general purpose altration, like data to acta that

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner