Skip to yearly menu bar Skip to main content


Understanding Hidden Context in Preference Learning: Consequences for RLHF

Anand Siththaranajn · Cassidy Laidlaw · Dylan Hadfield-Menell

Abstract

Chat is not available.