Skip to yearly menu bar Skip to main content


Surrogate Minimization: An Optimization Algorithm for Training Large Neural Networks with Model Parallelism

Reza Asad ⋅ Reza Babanezhad Harikandeh ⋅ Issam Hadj Laradji ⋅ Nicolas Le Roux ⋅ Sharan Vaswani

Abstract

Chat is not available.