Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Workshop on Multi-Turn Interactions in Large Language Models
Sat, Dec 6, 2025 • 3:45 PM – 4:45 PM PST

$\mathbf{T^3}$: Reducing Belief Deviation in Reinforcement Learning for Active Reasoning

Deyu Zou · Yongqiang Chen · Jianxiang Wang · Garry YANG · Mufei Li · Qing Da · Pan Li · Yu Gong · James Cheng

Abstract

Chat is not available.