Skip to yearly menu bar Skip to main content


San Diego Poster Thu, Dec 4, 2025 • 4:30 PM – 7:30 PM PST Exhibit Hall C,D,E #409

Improving Reward Models with Proximal Policy Exploration for Preference-Based Reinforcement Learning

Yiwen Zhu · Jinyi Liu · Pengjie Gu · Yifu Yuan · Zhenxing Ge · Wenya Wei · Zhou Fang · Yujing Hu · Bo An

Abstract

Video

Chat is not available.