Skip to yearly menu bar Skip to main content


San Diego Poster Thu, Dec 4, 2025 • 11:00 AM – 2:00 PM PST Exhibit Hall C,D,E #5308

Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models

Yonggan Fu · Xin Dong · Shizhe Diao · Matthijs Van keirsbilck · Hanrong Ye · Wonmin Byeon · Yashaswi Karnati · Lucas Liebenwein · Maksim Khadkevich · Alexander Keller · Jan Kautz · Yingyan (Celine) Lin · Pavlo Molchanov

Abstract

Log in and register to view live content