Skip to yearly menu bar Skip to main content


Poster

Why Does Sharpness-Aware Minimization Generalize Better Than SGD?

Zixiang Chen ⋅ Junkai Zhang ⋅ Yiwen Kou ⋅ Xiangning Chen ⋅ Cho-Jui Hsieh ⋅ Quanquan Gu
2023 Poster

Abstract

Video

Chat is not available.