Timezone: »
Poster
Accelerated Gradient Methods for Stochastic Optimization and Online Learning
Chonghai Hu · James Kwok · Weike Pan
Regularized risk minimization often involves non-smooth optimization, either because of the loss function (e.g., hinge loss) or the regularizer (e.g., $\ell_1$-regularizer). Gradient descent methods, though highly scalable and easy to implement, are known to converge slowly on these problems. In this paper, we develop novel accelerated gradient methods for stochastic optimization while still preserving their computational simplicity and scalability. The proposed algorithm, called SAGE (Stochastic Accelerated GradiEnt), exhibits fast convergence rates on stochastic optimization with both convex and strongly convex objectives. Experimental results show that SAGE is faster than recent (sub)gradient methods including FOLOS, SMIDAS and SCD. Moreover, SAGE can also be extended for online learning, resulting in a simple but powerful algorithm.
Author Information
Chonghai Hu (Zhejiang University)
James Kwok (Hong Kong University of Science and Technology)
Weike Pan (Hong Kong UST)
More from the Same Authors
-
2021 Spotlight: TOHAN: A One-step Approach towards Few-shot Hypothesis Adaptation »
Haoang Chi · Feng Liu · Wenjing Yang · Long Lan · Tongliang Liu · Bo Han · William Cheung · James Kwok -
2023 Poster: Efficient Hyper-parameter Optimization with Cubic Regularization »
Zhenqian Shen · Hansi Yang · Yong Li · James Kwok · Quanming Yao -
2023 Poster: Nonparametric Teaching for Multiple Learners »
Chen Zhang · Xiaofeng Cao · Weiyang Liu · Ivor Tsang · James Kwok -
2022 Poster: Multi-Objective Deep Learning with Adaptive Reference Vectors »
Weiyu Chen · James Kwok -
2021 Poster: Effective Meta-Regularization by Kernelized Proximal Regularization »
Weisen Jiang · James Kwok · Yu Zhang -
2021 Poster: TOHAN: A One-step Approach towards Few-shot Hypothesis Adaptation »
Haoang Chi · Feng Liu · Wenjing Yang · Long Lan · Tongliang Liu · Bo Han · William Cheung · James Kwok -
2020 Poster: Timeseries Anomaly Detection using Temporal Hierarchical One-Class Network »
Lifeng Shen · Zhuocong Li · James Kwok -
2020 Poster: Bridging the Gap between Sample-based and One-shot Neural Architecture Search with BONAS »
Han Shi · Renjie Pi · Hang Xu · Zhenguo Li · James Kwok · Tong Zhang -
2019 Poster: Communication-Efficient Distributed Blockwise Momentum SGD with Error-Feedback »
Shuai Zheng · Ziyue Huang · James Kwok -
2019 Poster: Normalization Helps Training of Quantized LSTM »
Lu Hou · Jinhua Zhu · James Kwok · Fei Gao · Tao Qin · Tie-Yan Liu -
2018 Poster: Scalable Robust Matrix Factorization with Nonconvex Loss »
Quanming Yao · James Kwok -
2015 Poster: Fast Second Order Stochastic Backpropagation for Variational Inference »
Kai Fan · Ziteng Wang · Jeff Beck · James Kwok · Katherine Heller -
2012 Poster: Mandatory Leaf Node Prediction in Hierarchical Multilabel Classification »
Wei Bi · James Kwok