

Distilling Transformers and Diffusion Models for Robust Edge Use Cases with Fatih Porikli - #738
57 snips Jul 9, 2025
In this conversation with Fatih Porikli, Senior Director of Technology at Qualcomm AI Research, he unveils cutting-edge innovations from the CVPR conference. He discusses DiMA, a groundbreaking system using large language models for safe autonomous driving, dramatically reducing collision rates. Fatih also dives into SharpDepth, enhancing depth prediction through diffusion distillation. He highlights impressive on-device demos, from text-to-3D mesh generation to real-time video fabrication, showcasing the future of AI and computer vision.
AI Snips
Chapters
Books
Transcript
Episode notes
End-to-End Driving Systems Advantage
- End-to-end autonomous driving systems optimize all components with the driving safety objective in mind.
- This approach outperforms modular systems that individually optimize components but lack overall harmony.
Semantic Interpretability in DEMA
- End-to-end models like DEMA provide semantic explanations for driving decisions.
- This improves interpretability by helping users understand why the vehicle acts a certain way.
DEMA's State-of-the-Art Performance
- DEMA reduces collision rates by 80% compared to state-of-the-art baselines and improves trajectory estimation accuracy by 40%.
- It generalizes well to unseen rare scenarios, showcasing strong long-tail performance.