Skip to yearly menu bar Skip to main content


Oral
in
Workshop: Language Gamification

Multi-Step Preference Optimization via Two-Player Markov Games

Yongtao Wu ⋅ Luca Viano ⋅ Yihang Chen ⋅ Zhenyu Zhu ⋅ Quanquan Gu ⋅ Volkan Cevher
2024 Oral
in
Workshop: Language Gamification

Abstract

Video

Chat is not available.