[19] Dumitru Erhan - Understanding Deep Architectures and the Effect of Unsupervised Pretraining

Feb 19, 2021

Dumitru Erhan, a Research Scientist at Google Brain, dives into the fascinating world of neural networks. He discusses his groundbreaking PhD work on deep architectures and unsupervised pretraining. The conversation touches on the evolution of deep learning, the significance of regularization hypotheses, and the philosophical nuances in AI task conceptualization. Dumitru shares insights into the transition from traditional computer vision to deep neural networks and highlights the importance of unexpected outcomes in enhancing research understanding.

Ask episode

AI Snips

Chapters

Books

Transcript

Episode notes

ANECDOTE

Accidental Entry to ML

Dumitru Erhan's entry into machine learning was accidental, starting with an internship in Helsinki, Finland.
Despite no prior knowledge, he found it intriguing and pursued it further at his university with Herbert Jaeger.

ANECDOTE

A Risky PhD Topic

Yoshua Bengio initially hesitated to take Dumitru as a PhD student due to deep learning's uncertainty in 2006.
Dumitru accepted the risk, recognizing the field's exciting potential.

INSIGHT

Exploring the Unknown

The allure of exploring the unknown in deep learning excited Dumitru.
The techniques worked, but the underlying reasons were unclear, creating a fertile ground for research.

Get the Snipd Podcast app to discover more snips from this episode

Get the app