Skip to yearly menu bar Skip to main content


Spotlight Poster

Fine-Grained Human Feedback Gives Better Rewards for Language Model Training

Zeqiu Wu · Yushi Hu · Weijia Shi · Nouha Dziri · Alane Suhr · Prithviraj Ammanabrolu · Noah Smith · Mari Ostendorf · Hannaneh Hajishirzi
2023 Spotlight Poster

Abstract

Video

Chat is not available.