firstbacksecondback
46 Results
Poster
|
Fri 16:30 |
Scaling Laws in Linear Regression: Compute, Parameters, and Data Licong Lin · Jingfeng Wu · Sham Kakade · Peter Bartlett · Jason Lee |
|
Workshop
|
Sun 16:30 |
Language model scaling laws and zero-sum learning Andrei Mircea · Ekaterina Lobacheva · Supriyo Chakraborty · Nima Chitsazan · Irina Rish |
|
Workshop
|
SepONet: Efficient Large-Scale Physics-Informed Operator Learning Xinling Yu · Sean Hooten · Ziyue Liu · Yequan Zhao · Marco Fiorentino · Thomas Van Vaerenbergh · Zheng Zhang |
||
Poster
|
Fri 11:00 |
Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms Rafael Rafailov · Yaswanth Chittepu · Ryan Park · Harshit Sushil Sikchi · Joey Hejna · Brad Knox · Chelsea Finn · Scott Niekum |
|
Poster
|
Wed 11:00 |
Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies Chaofan Tao · Qian Liu · Longxu Dou · Niklas Muennighoff · Zhongwei Wan · Ping Luo · Min Lin · Ngai Wong |
|
Poster
|
Fri 16:30 |
Scaling laws for learning with real and surrogate data Ayush Jain · Andrea Montanari · Eren Sasoglu |
|
Workshop
|
Sat 12:00 |
Skilling laws: scaling laws for LLM benchmark performance Felipe Maia Polo · Seamus Somerstep · Leshem Choshen · Yuekai Sun · Mikhail Yurochkin |
|
Poster
|
Fri 11:00 |
An exactly solvable model for emergence and scaling laws in the multitask sparse parity problem yoonsoo nam · Nayara Fonseca · Seok Hyeong Lee · Chris Mingard · Ard Louis |
|
Workshop
|
A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules Kairong Luo · Haodong Wen · Shengding Hu · Zhenbo Sun · Zhiyuan Liu · Maosong Sun · Kaifeng Lyu · Wenguang Chen |
||
Poster
|
Wed 16:30 |
D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large Language Models Haoran Que · Jiaheng Liu · Ge Zhang · Chenchen Zhang · Xingwei Qu · Yinghao Ma · Feiyu Duan · ZhiqiBai zhiqi · JiakaiWang · Yuanxing Zhang · Xu Tan · Jie Fu · Jiamang Wang · Lin Qu · Wenbo Su · Bo Zheng |