
Ep#12 VaViM and VaVAM: Autonomous Driving through Video Generative Modeling
RoboPapers
00:00
Exploring Spatio-Temporal Learnable Embeddings in Autonomous Vehicles
This chapter delves into spatio-temporal learnable embeddings in autonomous driving models, distinguishing between spatial and temporal components. It highlights initial findings that show the model's ability to cluster objects like pedestrians and road markings, revealing its potential for recognizing patterns in driving scenarios without supervision.
Transcript
Play full episode