Skip to yearly menu bar Skip to main content


Nano: Nested Human-in-the-Loop Reward Learning for Controlling Distribution of Generated Text

Xiang Fan · · Paul Pu Liang · Ruslan Salakhutdinov · Louis-Philippe Morency
[ Poster

Abstract

Chat is not available.