Skip to yearly menu bar Skip to main content


Post Training Quantization of Large Language Models with Microscaling Formats

Sayeh Sharify ⋅ Utkarsh Saxena ⋅ Zifei Xu ⋅ Wanzin Yazar ⋅ Ilya Soloveychik ⋅ Xin Wang

Abstract

Video

Chat is not available.