Skip to yearly menu bar Skip to main content


Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Tong Zheng ⋅ Hongming Zhang ⋅ Wenhao Yu ⋅ Xiaoyang Wang ⋅ He Xing ⋅ Runpeng(Leo) Dai ⋅ Rui Liu ⋅ Huiwen Bao ⋅ Chengsong Huang ⋅ Heng Huang ⋅ Dong Yu

Abstract

Chat is not available.