Skip to yearly menu bar Skip to main content


Nano: Nested Human-in-the-Loop Reward Learning for Controlling Distribution of Generated Text

Xiang Fan ⋅ ⋅ Paul Pu Liang ⋅ Ruslan Salakhutdinov ⋅ Louis-Philippe Morency
[ Poster

Abstract

Chat is not available.