Skip to yearly menu bar Skip to main content


Training and inference of large language models using 8-bit floating point

Sergio Perez ⋅ Yan Zhang ⋅ James Briggs ⋅ Charles Blake ⋅ Josh Levy-Kramer ⋅ Paul Balanca ⋅ Carlo Luschi ⋅ Stephen Barlow ⋅ Andrew Fitzgibbon

Abstract

Video

Chat is not available.