Skip to yearly menu bar Skip to main content


Approximations may be all you need: Towards Pre-training LLMs with Low-Rank Decomposition and Optimizers

Namrata Shivagunde ⋅ Mayank Kulkarni ⋅ Giannis Karamanolakis ⋅ Jack FitzGerald ⋅ Yannick Versley ⋅ Saleh Soltan ⋅ Volkan Cevher ⋅ Jianhua Lu ⋅ Anna Rumshisky

Abstract

Video

Chat is not available.