
Hyperparameter Optimization through Neural Network Partitioning with Christos Louizos - #627
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
00:00
Optimizing Scheduling and Attention in Neural Networks
This chapter delves into recent innovations in scheduling algorithms for computational graphs and neural networks, highlighting two research papers with distinct methodologies. It also examines advanced techniques in transformer models, specifically a composite slice transformer, that optimize attention mechanisms and address the computational challenges typically associated with them.
Transcript
Play full episode