Skip to yearly menu bar Skip to main content


Poster Thu, Dec 4, 2025 • 11:00 AM – 2:00 PM PST

Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models

Yonggan Fu ⋅ Xin Dong ⋅ Shizhe Diao ⋅ Matthijs Van keirsbilck ⋅ Hanrong Ye ⋅ Wonmin Byeon ⋅ Yashaswi Karnati ⋅ Lucas Liebenwein ⋅ Maksim Khadkevich ⋅ Alexander Keller ⋅ Jan Kautz ⋅ Yingyan (Celine) Lin ⋅ Pavlo Molchanov

Abstract

Video

Chat is not available.