The Thesis Review

Each episode of The Thesis Review is a conversation centered around a researcher's PhD thesis, giving insight into their history, revisiting older ideas, and providing a valuable perspective on how their research has evolved (or stayed the same) since.

Latest episodes

Jul 16, 2021 • 1h 6min

[23] Simon Du - Gradient Descent for Non-convex Problems in Modern Machine Learning

Simon Du, an Assistant Professor at the University of Washington, delves into the theoretical foundations of deep learning and gradient descent. He discusses the intricacies of addressing non-convex problems, revealing challenges and insights from his research. The conversation highlights the significance of the neural tangent kernel and its implications for optimization and generalization. Simon also shares practical tips for reading research papers, drawing connections between theory and practice, and navigating a successful research career.

4 snips

Apr 2, 2021 • 1h 3min

[22] Graham Neubig - Unsupervised Learning of Lexical Information

Graham Neubig is an Associate Professor at Carnegie Mellon University. His research focuses on language and its role in human communication, with the goal of breaking down barriers in human-human or human-machine communication through the development of NLP technologies. Graham’s PhD thesis is titled "Unsupervised Learning of Lexical Information for Language Processing Systems", which he completed in 2012 at Kyoto University. We discuss his PhD work related to the fundamental processing units that NLP systems use to process text, including non-parametric Bayesian models, segmentation, and alignment problems, and discuss how his perspective on machine translation has evolved over time. Episode notes: http://cs.nyu.edu/~welleck/episode22.html Follow the Thesis Review (@thesisreview) and Sean Welleck (@wellecks) on Twitter, and find out more info about the show at http://cs.nyu.edu/~welleck/podcast.html Support The Thesis Review at www.patreon.com/thesisreview or www.buymeacoffee.com/thesisreview

Mar 19, 2021 • 1h 8min

[21] Michela Paganini - Machine Learning Solutions for High Energy Physics

Michela Paganini, a Research Scientist at DeepMind, focuses on compressing and scaling neural networks. She shares insights from her PhD on machine learning in high energy physics, particularly around the ATLAS experiment at CERN. The conversation delves into jet tagging and the evolution from traditional methods to deep learning. Michela reflects on her transformative experiences at CERN during the Higgs boson discovery and the interplay between physics and machine learning, emphasizing mentorship's role in her innovative journey.

Mar 5, 2021 • 1h 25min

[20] Josef Urban - Deductive and Inductive Reasoning in Large Libraries of Formalized Mathematics

Josef Urban, Principal Researcher at the Czech Institute of Informatics, shares his insights into artificial intelligence and automated theorem proving. He discusses the balance between deductive and inductive reasoning in formal mathematics, alongside the significance of the Mizar system. Topics include the philosophy of mathematics as invention versus discovery and the challenges of formalization. He also highlights advances in premise selection using machine learning, and reflects on his PhD journey that shaped his dedication to meaningful scientific inquiry.

Feb 19, 2021 • 1h 20min

[19] Dumitru Erhan - Understanding Deep Architectures and the Effect of Unsupervised Pretraining

Dumitru Erhan, a Research Scientist at Google Brain, dives into the fascinating world of neural networks. He discusses his groundbreaking PhD work on deep architectures and unsupervised pretraining. The conversation touches on the evolution of deep learning, the significance of regularization hypotheses, and the philosophical nuances in AI task conceptualization. Dumitru shares insights into the transition from traditional computer vision to deep neural networks and highlights the importance of unexpected outcomes in enhancing research understanding.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app

The Thesis Review

Latest episodes

[28] Karen Ullrich - A Coding Perspective on Deep Latent Variable Models

[27] Danqi Chen - Neural Reading Comprehension and Beyond

[26] Kevin Ellis - Algorithms for Learning to Induce Programs