Skip to yearly menu bar Skip to main content


Poster

HelpSteer 2: Open-source dataset for training top-performing reward models

Zhilin Wang ⋅ Yi Dong ⋅ Olivier Delalleau ⋅ Jiaqi Zeng ⋅ Gerald Shen ⋅ Daniel Egert ⋅ Jimmy Zhang ⋅ Makesh Narsimhan Sreedhar ⋅ Oleksii Kuchaiev
2024 Poster

Abstract

Video

Chat is not available.