Timezone: »

Training Very Deep Networks
Rupesh K Srivastava · Klaus Greff · Jürgen Schmidhuber

Wed Dec 09 04:00 PM -- 08:59 PM (PST) @ 210 C #4 #None

Theoretical and empirical evidence indicates that the depth of neural networks is crucial for their success. However, training becomes more difficult as depth increases, and training of very deep networks remains an open problem. Here we introduce a new architecture designed to overcome this. Our so-called highway networks allow unimpeded information flow across many layers on information highways. They are inspired by Long Short-Term Memory recurrent networks and use adaptive gating units to regulate the information flow. Even with hundreds of layers, highway networks can be trained directly through simple gradient descent. This enables the study of extremely deep and efficient architectures.

Author Information

Rupesh K Srivastava (IDSIA)
Klaus Greff (IDSIA)
Jürgen Schmidhuber (IDSIA)

Since age 15, his main goal has been to build an Artificial Intelligence smarter than himself, then retire. The Deep Learning Artificial Neural Networks developed since 1991 by his research groups have revolutionised handwriting recognition, speech recognition, machine translation, image captioning, and are now available to billions of users through Google, Microsoft, IBM, Baidu, and many other companies (DeepMind also was heavily influenced by his lab). His team's Deep Learners were the first to win object detection and image segmentation contests, and achieved the world's first superhuman visual classification results, winning nine international competitions in machine learning & pattern recognition. His formal theory of fun & creativity & curiosity explains art, science, music, and humor. He has published 333 papers, earned 7 best paper/best video awards, the 2013 Helmholtz Award of the International Neural Networks Society, and the 2016 IEEE Neural Networks Pioneer Award. He is also president of NNAISENSE, which aims at building the first practical general purpose AI.

More from the Same Authors