Skip to yearly menu bar Skip to main content


Poster

SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning

Zhongwei Wan ⋅ Zhihao Dou ⋅ Che Liu ⋅ Yu Zhang ⋅ Dongfei Cui ⋅ Qinjian Zhao ⋅ Hui Shen ⋅ Jing Xiong ⋅ Yi Xin ⋅ Yifan Jiang ⋅ Chaofan Tao ⋅ Yangfan He ⋅ Mi Zhang ⋅ Shen Yan
2025 Poster

Abstract

Video

Chat is not available.