Skip to yearly menu bar Skip to main content


San Diego Poster Thu, Dec 4, 2025 • 4:30 PM – 7:30 PM PST Exhibit Hall C,D,E #409

Improving Reward Models with Proximal Policy Exploration for Preference-Based Reinforcement Learning

Yiwen Zhu · Jinyi Liu · Pengjie Gu · Yifu Yuan · Zhenxing Ge · Wenya Wei · Zhou Fang · Yujing Hu · Bo An

Abstract

Log in and register to view live content