Skip to yearly menu bar Skip to main content


Spotlight Poster

Fine-Grained Human Feedback Gives Better Rewards for Language Model Training

Zeqiu Wu ⋅ Yushi Hu ⋅ Weijia Shi ⋅ Nouha Dziri ⋅ Alane Suhr ⋅ Prithviraj Ammanabrolu ⋅ Noah Smith ⋅ Mari Ostendorf ⋅ Hannaneh Hajishirzi
2023 Spotlight Poster

Abstract

Video

Chat is not available.