Skip to yearly menu bar Skip to main content


Beyond Reverse KL: Generalizing Direct Preference Optimization with Diverse Divergence Constraints

Chaoqi Wang ⋅ Yibo Jiang ⋅ Chenghao Yang ⋅ Han Liu ⋅ Yuxin Chen

Abstract

Chat is not available.