Skip to yearly menu bar Skip to main content


DreamPRM: Domain-Reweighted Process Reward Model for Multimodal Reasoning

Qi Cao ⋅ Ruiyi Wang ⋅ Ruiyi Zhang ⋅ Sai Ashish Somayajula ⋅ Pengtao Xie

Abstract

Chat is not available.