Skip to yearly menu bar Skip to main content


Poster

D-LLM: A Token Adaptive Computing Resource Allocation Strategy for Large Language Models

Yikun Jiang ⋅ Huanyu Wang ⋅ Lei Xie ⋅ Hanbin Zhao ⋅ zhang chao ⋅ Hui Qian ⋅ John C. S. Lui
2024 Poster

Abstract

Video

Chat is not available.