3min chapter

Towards Data Science cover image

121. Alexei Baevski - data2vec and the future of multimodal learning

Towards Data Science

CHAPTER

Perceiverso

Perceiver has an attentional mechanism to incode each type of data. So the query for each modais modality specific. And then you use this query, or set of queries, to to pull information from the underlying sample into a fixed size and time f segments red. This allows you to actually not really think about how to process underlying data. You don't meed to design a full kind of mural architecture that learns how to incode this particular type of modality into latent space. Instead, your architecture for your incoder is fixed. It's much simpler than designing a feature in coder. If we have a general purpose altration, like data to acta that

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode