Timezone: »
The training of neural networks by gradient descent methods is a cornerstone of the deep learning revolution. Yet, despite some recent progress, a complete theory explaining its success is still missing. This article presents, for orthogonal input vectors, a precise description of the gradient flow dynamics of training one-hidden layer ReLU neural networks for the mean squared error at small initialisation. In this setting, despite non-convexity, we show that the gradient flow converges to zero loss and characterise its implicit bias towards minimum variation norm. Furthermore, some interesting phenomena are highlighted: a quantitative description of the initial alignment phenomenon and a proof that the process follows a specific saddle to saddle dynamics.
Author Information
Etienne Boursier (EPFL)
Loucas PILLAUD-VIVIEN (INRIA)
Nicolas Flammarion (EPFL)
More from the Same Authors
-
2021 Spotlight: Decentralized Learning in Online Queuing Systems »
Flore Sentenac · Etienne Boursier · Vianney Perchet -
2021 Poster: Making the most of your day: online learning for optimal allocation of time »
Etienne Boursier · Tristan Garrec · Vianney Perchet · Marco Scarsini -
2021 Poster: Decentralized Learning in Online Queuing Systems »
Flore Sentenac · Etienne Boursier · Vianney Perchet -
2020 Poster: Statistical Efficiency of Thompson Sampling for Combinatorial Semi-Bandits »
Pierre Perrault · Etienne Boursier · Michal Valko · Vianney Perchet -
2020 Poster: Online Robust Regression via SGD on the l1 loss »
Scott Pesme · Nicolas Flammarion -
2020 Poster: Understanding and Improving Fast Adversarial Training »
Maksym Andriushchenko · Nicolas Flammarion -
2019 Poster: Escaping from saddle points on Riemannian manifolds »
Yue Sun · Nicolas Flammarion · Maryam Fazel -
2019 Poster: SIC-MMAB: Synchronisation Involves Communication in Multiplayer Multi-Armed Bandits »
Etienne Boursier · Vianney Perchet -
2019 Spotlight: SIC-MMAB: Synchronisation Involves Communication in Multiplayer Multi-Armed Bandits »
Etienne Boursier · Vianney Perchet