Skip to yearly menu bar Skip to main content


Poster

Surge Phenomenon in Optimal Learning Rate and Batch Size Scaling

Shuaipeng Li ⋅ Penghao Zhao ⋅ Hailin Zhang ⋅ Xingwu Sun ⋅ Hao Wu ⋅ Dian Jiao ⋅ Weiyan Wang ⋅ Chengjun Liu ⋅ Zheng Fang ⋅ Jinbao Xue ⋅ Yangyu Tao ⋅ Bin CUI ⋅ Di Wang
2024 Poster
[ Paper [ Slides [ Poster [ OpenReview

Abstract

Video

Chat is not available.