Vincent Sitzmann, MIT: Neural scene representations for computer vision and more general AI

May 20, 2021

Vincent Sitzmann, a postdoc at MIT, specializes in neural scene representations for computer vision. He discusses the crucial shift from 2D to 3D representations in AI, emphasizing how our understanding of vision should mirror the 3D nature of the world. Topics include the complexities of neural networks, the relationship between human perception and AI, and advancements in training techniques like self-supervised learning. Sitzmann also explores innovative applications of implicit representations and shares insights on effective research strategies for budding scientists.

Ask episode

Chapters

Transcript

Episode notes

Intro

00:00 • 4min

Exploring Vision: From 2D to 3D Representations in AI

03:41 • 3min

Neural Representations in Visual Processing

06:59 • 28min

Navigating Perception and Neural Networks

35:06 • 7min

Exploring 3D Representations in Computer Vision

42:11 • 20min

Exploring Practical Applications of Implicit Representations

01:02:40 • 2min

Navigating Research: Strategies and Mindsets

01:04:33 • 6min