Skip to yearly menu bar Skip to main content


Beyond Reverse KL: Generalizing Direct Preference Optimization with Diverse Divergence Constraints

Chaoqi Wang · Yibo Jiang · Chenghao Yang · Han Liu · Yuxin Chen

Abstract

Chat is not available.