Skip to yearly menu bar Skip to main content


Search All 2023 Events
 

1 Results

<<   <   Page 1 of 1   >>   >
Workshop
Reward Model Underspecification in Language Model Alignment
Jacob Eisenstein · Jonathan Berant · Chirag Nagpal · Alekh Agarwal · Ahmad Beirami · Alexander D'Amour · Krishnamurthy Dvijotham · Katherine Heller · Stephen Pfohl · Deepak Ramachandran