Skip to yearly menu bar Skip to main content


Poster

TA-MoE: Topology-Aware Large Scale Mixture-of-Expert Training

Chang Chen ⋅ Min Li ⋅ Zhihua Wu ⋅ Dianhai Yu ⋅ Chao Yang

Abstract

Video

Chat is not available.