Skip to yearly menu bar Skip to main content


Poster

Rethinking Memory and Communication Costs for Efficient Data Parallel Training of Large Language Models

Hanxiao Zhang ⋅ Lin JU ⋅ Chan Wu ⋅ Jinjing Huang ⋅ Youshao Xiao ⋅ Zhenglei Zhou ⋅ Zhiming fan ⋅ Zhaoxin Huan ⋅ Siyuan Li ⋅ Fanzhuang Meng ⋅ Lei Liang ⋅ Xiaolu Zhang ⋅ Jun Zhou
2024 Poster

Abstract

Video

Chat is not available.