firstbacksecondback
53 Results
Poster
|
Fri 11:00 |
Flex-MoE: Modeling Arbitrary Modality Combination via the Flexible Mixture-of-Experts Sukwon Yun · Inyoung Choi · Jie Peng · Yangfan Wu · Jingxuan Bao · Qiyiwen Zhang · Jiayi Xin · Qi Long · Tianlong Chen |
|
Expo Demonstration
|
Tue 15:00 |
Deploying Cached Conditional Mixture-of-Experts LLMs on Mobile Devices with Memory Constraints Ron Tindall |
|
Workshop
|
Dense Backpropagation Improves Routing for Sparsely-Gated Mixture-of-Experts Ashwinee Panda · Vatsal Baherwani · Zain Sarwar · Benjamin Thérien · Stephen Rawls · Sambit Sahu · Supriyo Chakraborty · Tom Goldstein |
||
Workshop
|
Dense Backpropagation Improves Routing for Sparsely-Gated Mixture-of-Experts Ashwinee Panda · Vatsal Baherwani · Zain Sarwar · Benjamin Thérien · Stephen Rawls · Sambit Sahu · Supriyo Chakraborty · Tom Goldstein |
||
Workshop
|
Dense Backpropagation Improves Routing for Sparsely-Gated Mixture-of-Experts Ashwinee Panda · Vatsal Baherwani · Zain Sarwar · Benjamin Therien · Sambit Sahu · Stephen Rawls · Supriyo Chakraborty · Tom Goldstein |
||
Poster
|
Fri 16:30 |
Exploiting Activation Sparsity with Dense to Dynamic-k Mixture-of-Experts Conversion Filip Szatkowski · Bartosz Wójcik · Mikołaj Piórczyński · Simone Scardapane |
|
Workshop
|
Understanding Compute-Parameter Trade-offs in Sparse Mixture-of-Expert Language Models Harshay Shah · Vimal Thilak · Dan Busbridge · Alaaeldin El-Nouby · Joshua Susskind · Samira Abnar |
||
Poster
|
Fri 16:30 |
Parameter Efficient Adaptation for Image Restoration with Heterogeneous Mixture-of-Experts Hang Guo · Tao Dai · Yuanchao Bai · Bin Chen · Xudong Ren · Zexuan Zhu · Shu-Tao Xia |
|
Poster
|
Fri 16:30 |
MoEUT: Mixture-of-Experts Universal Transformers Robert Csordas · Kazuki Irie · Jürgen Schmidhuber · Christopher Potts · Christopher D Manning |
|
Poster
|
Wed 11:00 |
Multi-Head Mixture-of-Experts Xun Wu · Shaohan Huang · Wenhui Wang · Shuming Ma · Li Dong · Furu Wei |
|
Poster
|
Thu 11:00 |
SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention Robert Csordas · Piotr Piękos · Kazuki Irie · Jürgen Schmidhuber |
|
Workshop
|
Tabby: Tabular Adaptation for Language Models Sonia Cromp · Satya Sai Srinath Namburi · Catherine Cao · Mohammed Alkhudhayri · Samuel Guo · Nicholas Roberts · Frederic Sala |