Skip to yearly menu bar Skip to main content


Compositional preference models for alignment with scalable oversight

Dongyoung Go ⋅ Tomasz Korbak ⋅ Germán Kruszewski ⋅ Jos Rozen ⋅ Marc Dymetman

Abstract

Chat is not available.