Skip to yearly menu bar Skip to main content


Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Tong Zheng · Hongming Zhang · Wenhao Yu · Xiaoyang Wang · He Xing · Runpeng(Leo) Dai · Rui Liu · Huiwen Bao · Chengsong Huang · Heng Huang · Dong Yu

Abstract

Chat is not available.