Skip to yearly menu bar Skip to main content


Oral
in
Workshop: Language Gamification

Multi-Step Preference Optimization via Two-Player Markov Games

Yongtao Wu · Luca Viano · Yihang Chen · Zhenyu Zhu · Quanquan Gu · Volkan Cevher
2024 Oral
in
Workshop: Language Gamification

Abstract

Video

Chat is not available.