Skip to yearly menu bar Skip to main content


Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Mengzhou Xia ⋅ Tianyu Gao ⋅ Zhiyuan Zeng ⋅ Danqi Chen

Abstract

Video

Chat is not available.