Skip to yearly menu bar Skip to main content


Good regularity creates large learning rate implicit biases: edge of stability, balancing, and catapult

Yuqing Wang ⋅ Zhenghao Xu ⋅ Tuo Zhao ⋅ Molei Tao

Abstract

Chat is not available.