Skip to yearly menu bar Skip to main content


Poster Wed, Dec 3, 2025 • 4:30 PM – 7:30 PM PST

On-Policy Optimization with Group Equivalent Preference for Multi-Programming Language Understanding

Haoyuan Wu ⋅ Rui Ming ⋅ Jilong Gao ⋅ Hangyu Zhao ⋅ Xueyi Chen ⋅ Yikai Yang ⋅ Haisheng Zheng ⋅ Zhuolun He ⋅ Bei Yu

Abstract

Video

Chat is not available.