Skip to yearly menu bar Skip to main content


San Diego Poster Wed, Dec 3, 2025 • 11:00 AM – 2:00 PM PST Exhibit Hall C,D,E #4903

SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning

Zhongwei Wan · Zhihao Dou · Che Liu · Yu Zhang · Dongfei Cui · Qinjian Zhao · Hui Shen · Jing Xiong · Yi Xin · Yifan Jiang · Chaofan Tao · Yangfan He · Mi Zhang · Shen Yan

Abstract

Log in and register to view live content