Skip to yearly menu bar Skip to main content


Poster

TA-MoE: Topology-Aware Large Scale Mixture-of-Expert Training

Chang Chen · Min Li · Zhihua Wu · Dianhai Yu · Chao Yang

Abstract

Video

Chat is not available.