Skip to yearly menu bar Skip to main content


Poster Thu, Dec 4, 2025 • 4:30 PM – 7:30 PM PST

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Shenzhi Wang ⋅ Le Yu ⋅ Chang Gao ⋅ Chujie Zheng ⋅ Shixuan Liu ⋅ Rui Lu ⋅ Kai Dang ⋅ Xiong-Hui Chen ⋅ Jianxin Yang ⋅ Zhenru Zhang ⋅ Yuqiong Liu ⋅ An Yang ⋅ Andrew Zhao ⋅ Yang Yue ⋅ Shiji Song ⋅ Bowen Yu ⋅ Gao Huang ⋅ Junyang Lin

Abstract

Video

Chat is not available.