Skip to yearly menu bar Skip to main content


Poster

Compressing Large Language Models using Low Rank and Low Precision Decomposition

Rajarshi Saha ⋅ Naomi Sagan ⋅ Varun Srivastava ⋅ Andrea Goldsmith ⋅ Mert Pilanci
2024 Poster
[ Paper [ Slides [ Poster [ OpenReview

Abstract

Video

Chat is not available.