The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Distilling Transformers and Diffusion Models for Robust Edge Use Cases with Fatih Porikli - #738

57 snips
Jul 9, 2025
In this conversation with Fatih Porikli, Senior Director of Technology at Qualcomm AI Research, he unveils cutting-edge innovations from the CVPR conference. He discusses DiMA, a groundbreaking system using large language models for safe autonomous driving, dramatically reducing collision rates. Fatih also dives into SharpDepth, enhancing depth prediction through diffusion distillation. He highlights impressive on-device demos, from text-to-3D mesh generation to real-time video fabrication, showcasing the future of AI and computer vision.
Ask episode
AI Snips
Chapters
Books
Transcript
Episode notes
INSIGHT

End-to-End Driving Systems Advantage

  • End-to-end autonomous driving systems optimize all components with the driving safety objective in mind.
  • This approach outperforms modular systems that individually optimize components but lack overall harmony.
INSIGHT

Semantic Interpretability in DEMA

  • End-to-end models like DEMA provide semantic explanations for driving decisions.
  • This improves interpretability by helping users understand why the vehicle acts a certain way.
INSIGHT

DEMA's State-of-the-Art Performance

  • DEMA reduces collision rates by 80% compared to state-of-the-art baselines and improves trajectory estimation accuracy by 40%.
  • It generalizes well to unseen rare scenarios, showcasing strong long-tail performance.
Get the Snipd Podcast app to discover more snips from this episode
Get the app