Skip to yearly menu bar Skip to main content


Spotlight Poster

Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training

Wenyu Du ⋅ Tongxu Luo ⋅ Zihan Qiu ⋅ Zeyu Huang ⋅ Yikang Shen ⋅ Reynold Cheng ⋅ Yike Guo ⋅ Jie Fu
2024 Spotlight Poster

Abstract

Video

Chat is not available.