Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

26 Results

<<   <   Page 3 of 3   >>   >
Workshop
A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules
Kairong Luo · Haodong Wen · Shengding Hu · Zhenbo Sun · Zhiyuan Liu · Maosong Sun · Kaifeng Lyu · Wenguang Chen
Poster
Wed 16:30 D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large Language Models
Haoran Que · Jiaheng Liu · Ge Zhang · Chenchen Zhang · Xingwei Qu · Yinghao Ma · Feiyu Duan · ZhiqiBai zhiqi · JiakaiWang · Yuanxing Zhang · Xu Tan · Jie Fu · Jiamang Wang · Lin Qu · Wenbo Su · Bo Zheng