Skip to yearly menu bar Skip to main content


(23 events)   Timezone:  
Show all
Toggle Poster Visibility
Poster
Thu Dec 12 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #165
Explaining Landscape Connectivity of Low-cost Solutions for Multilayer Nets
Rohith Kuditipudi · Xiang Wang · Holden Lee · Yi Zhang · Zhiyuan Li · Wei Hu · Rong Ge · Sanjeev Arora
[ Paper [ Slides
Poster
Thu Dec 12 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #166
Leader Stochastic Gradient Descent for Distributed Training of Deep Learning Models
Yunfei Teng · Wenbo Gao · François Chalus · Anna Choromanska · Donald Goldfarb · Adrian Weller
[ Paper [ Poster
Poster
Thu Dec 12 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #167
Learning Neural Networks with Adaptive Regularization
Han Zhao · Yao-Hung Hubert Tsai · Russ Salakhutdinov · Geoffrey Gordon
[ Paper [ Poster
Poster
Thu Dec 12 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #168
Memory Efficient Adaptive Optimization
Rohan Anil · Vineet Gupta · Tomer Koren · Yoram Singer
[ Paper [ Slides
Poster
Thu Dec 12 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #169
On the Convergence Rate of Training Recurrent Neural Networks
Zeyuan Allen-Zhu · Yuanzhi Li · Zhao Song
[ Paper [ Poster
Poster
Thu Dec 12 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #170
SGD on Neural Networks Learns Functions of Increasing Complexity
Dimitris Kalimeris · Gal Kaplun · Preetum Nakkiran · Benjamin Edelman · Tristan Yang · Boaz Barak · Haofeng Zhang
[ Paper [ Poster [ Slides
Poster
Thu Dec 12 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #171
Towards Understanding the Importance of Shortcut Connections in Residual Networks
Tianyi Liu · Minshuo Chen · Mo Zhou · Simon Du · Enlu Zhou · Tuo Zhao
[ Paper [ Poster
Poster
Thu Dec 12 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #172
Trivializations for Gradient-Based Optimization on Manifolds
Mario Lezcano Casado
[ Paper [ Poster
Poster
Thu Dec 12 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #173
Using Statistics to Automate Stochastic Optimization
Hunter Lang · Lin Xiao · Pengchuan Zhang
[ Paper [ Poster
Poster
Thu Dec 12 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #174
Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model
Guodong Zhang · Lala Li · Zachary Nado · James Martens · Sushant Sachdeva · George Dahl · Chris Shallue · Roger Grosse
[ Paper [ Poster
Poster
Thu Dec 12 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #175
Wide Neural Networks of Any Depth Evolve as Linear Models Under Gradient Descent
Jaehoon Lee · Lechao Xiao · Samuel Schoenholz · Yasaman Bahri · Roman Novak · Jascha Sohl-Dickstein · Jeffrey Pennington
[ Paper [ Poster
Poster
Thu Dec 12 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #195
Algorithm-Dependent Generalization Bounds for Overparameterized Deep Residual Networks
Spencer Frei · Yuan Cao · Quanquan Gu
[ Paper [ Poster
Poster
Thu Dec 12 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #196
Are deep ResNets provably better than linear predictors?
Chulhee Yun · Suvrit Sra · Ali Jadbabaie
[ Paper [ Slides
Poster
Thu Dec 12 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #197
Efficient Rematerialization for Deep Networks
Ravi Kumar · Manish Purohit · Zoya Svitkina · Erik Vee · Joshua Wang
[ Paper [ Poster
Poster
Thu Dec 12 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #198
Fast Convergence of Natural Gradient Descent for Over-Parameterized Neural Networks
Guodong Zhang · James Martens · Roger Grosse
[ Paper [ Poster
Poster
Thu Dec 12 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #199
How to Initialize your Network? Robust Initialization for WeightNorm & ResNets
Devansh Arpit · Víctor Campos · Yoshua Bengio
[ Paper [ Poster
Poster
Thu Dec 12 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #200
Lookahead Optimizer: k steps forward, 1 step back
Michael Zhang · James Lucas · Jimmy Ba · Geoffrey E Hinton
[ Paper [ Slides
Poster
Thu Dec 12 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #201
Global Convergence of Gradient Descent for Deep Linear Residual Networks
Lei Wu · Qingcan Wang · Chao Ma
[ Paper [ Slides
Poster
Thu Dec 12 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #202
Piecewise Strong Convexity of Neural Networks
Tristan Milne
Poster
Thu Dec 12 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #203
PowerSGD: Practical Low-Rank Gradient Compression for Distributed Optimization
Thijs Vogels · Sai Praneeth Karimireddy · Martin Jaggi
Poster
Thu Dec 12 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #204
A Primal Dual Formulation For Deep Learning With Constraints
Yatin Nandwani · Abhishek Pathak · Mausam · Parag Singla
[ Paper [ Slides
Poster
Thu Dec 12 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #205
Surfing: Iterative Optimization Over Incrementally Trained Deep Networks
Ganlin Song · Zhou Fan · John Lafferty
[ Paper [ Poster
Poster
Thu Dec 12 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #206
Theoretical Limits of Pipeline Parallel Optimization and Application to Distributed Deep Learning
Igor Colin · Ludovic DOS SANTOS · Kevin Scaman
[ Paper [ Slides