Skip to yearly menu bar Skip to main content


San Diego Spotlight Poster Thu, Dec 4, 2025 • 4:30 PM – 7:30 PM PST Exhibit Hall C,D,E #111

Nemotron-CLIMB: Clustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Shizhe Diao · Yu Yang · Yonggan Fu · Xin Dong · Dan SU · Markus Kliegl · ZIJIA CHEN · Peter Belcak · Yoshi Suhara · Hongxu Yin · Mostofa Patwary · Yingyan (Celine) Lin · Jan Kautz · Pavlo Molchanov

Abstract

Log in and register to view live content