Timezone: »
We study stochastic composite mirror descent, a class of scalable algorithms able to exploit the geometry and composite structure of a problem. We consider both convex and strongly convex objectives with non-smooth loss functions, for each of which we establish high-probability convergence rates optimal up to a logarithmic factor. We apply the derived computational error bounds to study the generalization performance of multi-pass stochastic gradient descent (SGD) in a non-parametric setting. Our high-probability generalization bounds enjoy a logarithmical dependency on the number of passes provided that the step size sequence is square-summable, which improves the existing bounds in expectation with a polynomial dependency and therefore gives a strong justification on the ability of multi-pass SGD to overcome overfitting. Our analysis removes boundedness assumptions on subgradients often imposed in the literature. Numerical results are reported to support our theoretical findings.
Author Information
Yunwen Lei (Southern University of Science and Technology)
Ke Tang (Southern University of Science and Technology)
More from the Same Authors
-
2022 Spotlight: A Communication-Efficient Distributed Gradient Clipping Algorithm for Training Deep Neural Networks »
Mingrui Liu · Zhenxun Zhuang · Yunwen Lei · Chunyang Liao -
2022 Poster: A Communication-Efficient Distributed Gradient Clipping Algorithm for Training Deep Neural Networks »
Mingrui Liu · Zhenxun Zhuang · Yunwen Lei · Chunyang Liao -
2022 Poster: Stability and Generalization Analysis of Gradient Methods for Shallow Neural Networks »
Yunwen Lei · Rong Jin · Yiming Ying -
2022 Poster: Stability and Generalization for Markov Chain Stochastic Gradient Methods »
Puyu Wang · Yunwen Lei · Yiming Ying · Ding-Xuan Zhou -
2019 Poster: Optimal Stochastic and Online Learning with Individual Iterates »
Yunwen Lei · Peng Yang · Ke Tang · Ding-Xuan Zhou -
2019 Spotlight: Optimal Stochastic and Online Learning with Individual Iterates »
Yunwen Lei · Peng Yang · Ke Tang · Ding-Xuan Zhou -
2017 Poster: Log-normality and Skewness of Estimated State/Action Values in Reinforcement Learning »
Liangpeng Zhang · Ke Tang · Xin Yao -
2017 Poster: Subset Selection under Noise »
Chao Qian · Jing-Cheng Shi · Yang Yu · Ke Tang · Zhi-Hua Zhou -
2015 Poster: Multi-class SVMs: From Tighter Data-Dependent Generalization Bounds to Novel Algorithms »
Yunwen Lei · Urun Dogan · Alexander Binder · Marius Kloft