Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Language Gamification

Multi-Step Preference Optimization via Two-Player Markov Games

Yongtao Wu ⋅ Luca Viano ⋅ Yihang Chen ⋅ Zhenyu Zhu ⋅ Quanquan Gu ⋅ Volkan Cevher

Abstract

Chat is not available.