

Episode 11: Vincent Sitzmann, MIT, on neural scene representations for computer vision and more general AI
May 20, 2021
Vincent Sitzmann, a postdoc at MIT, specializes in neural scene representations for computer vision. He discusses the crucial shift from 2D to 3D representations in AI, emphasizing how our understanding of vision should mirror the 3D nature of the world. Topics include the complexities of neural networks, the relationship between human perception and AI, and advancements in training techniques like self-supervised learning. Sitzmann also explores innovative applications of implicit representations and shares insights on effective research strategies for budding scientists.
Chapters
Transcript
Episode notes
1 2 3 4 5 6 7
Intro
00:00 • 4min
Exploring Vision: From 2D to 3D Representations in AI
03:41 • 3min
Neural Representations in Visual Processing
06:59 • 28min
Navigating Perception and Neural Networks
35:06 • 7min
Exploring 3D Representations in Computer Vision
42:11 • 20min
Exploring Practical Applications of Implicit Representations
01:02:40 • 2min
Navigating Research: Strategies and Mindsets
01:04:33 • 6min