Skip to yearly menu bar Skip to main content


Magic Pyramid: Accelerating Inference with Early Exiting and Token Pruning

Xuanli He ⋅ Iman Keivanloo ⋅ Yi Xu ⋅ Xiang He ⋅ Belinda Zeng ⋅ Santosh Rajagopalan ⋅ Trishul Chilimbi

Abstract

Video

Chat is not available.