Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Workshop on Multi-Turn Interactions in Large Language Models
Sat, Dec 6, 2025 • 3:45 PM – 4:45 PM PST

$\mathbf{T^3}$: Reducing Belief Deviation in Reinforcement Learning for Active Reasoning

Deyu Zou ⋅ Yongqiang Chen ⋅ Jianxiang Wang ⋅ Garry YANG ⋅ Mufei Li ⋅ Qing Da ⋅ Pan Li ⋅ Yu Gong ⋅ James Cheng

Abstract

Chat is not available.