The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Data Augmentation and Optimized Architectures for Computer Vision with Fatih Porikli - #635

Jun 26, 2023

Fatih Porikli, Senior Director of Technology at Qualcomm AI Research, shares insights from over 30 years in computer vision. He explores cutting-edge topics such as data augmentation techniques, optimized architectures, and advances in optical flow for video analysis. The conversation delves into the use of language models for fine-grained labeling, enhancing 3D object detection, and the role of generative AI in model efficiency. Fatih also discusses training neural networks and innovative approaches to integrating various data sources for improved accuracy.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

INSIGHT

Improved Model Robustness

DistractFlow improves optical flow models by making them more robust to distractions.
This is demonstrated by a reduction in endpoint error on benchmarks like Sintel and KITTI.

INSIGHT

X3KD for 3D Object Detection

X3KD is a knowledge distillation technique for 3D object detection used in autonomous driving.
It uses data from multiple cameras and LiDAR during training but only images during runtime.

INSIGHT

Knowledge Distillation Explained

Knowledge distillation involves training a smaller "student" network to mimic a larger, more capable "teacher" network.
The goal is to achieve comparable accuracy with a more efficient model.

Get the Snipd Podcast app to discover more snips from this episode

Get the app