Skip to yearly menu bar Skip to main content


A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules

Kairong Luo ⋅ Haodong Wen ⋅ Shengding Hu ⋅ Zhenbo Sun ⋅ Zhiyuan Liu ⋅ Maosong Sun ⋅ Kaifeng Lyu ⋅ Wenguang Chen

Abstract

Chat is not available.