1min snip

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) cover image

V-JEPA, AI Reasoning from a Non-Generative Architecture with Mido Assran - #677

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

NOTE

Masking Semantic Regions for Learning from Videos

Masking out semantic regions in videos, focusing on objects and interactions rather than just backgrounds like the sky or grass, is crucial for generating more semantic representations and abstraction levels. This method involves identifying significant events and prompting the network to predict them, mirroring human visual attention that evolves over time. Understanding and leveraging this approach is vital when applying general mass modeling methods for learning from visual content.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode