
[23] Simon Du - Gradient Descent for Non-convex Problems in Modern Machine Learning
The Thesis Review
00:00
Exploring Generalization in Convolutional Neural Networks
This chapter explores the differences between optimization and generalization in deep learning, highlighting the advantages of convolutional neural networks over fully connected networks. It examines their performance on datasets like CIFAR and discusses the theoretical assumptions and statistical properties that underpin these comparative advantages.
Transcript
Play full episode