Skip to yearly menu bar Skip to main content


Poster

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Shenzhi Wang ⋅ Le Yu ⋅ Chang Gao ⋅ Chujie Zheng ⋅ Shixuan Liu ⋅ Rui Lu ⋅ Kai Dang ⋅ Xiong-Hui Chen ⋅ Jianxin Yang ⋅ Zhenru Zhang ⋅ Yuqiong Liu ⋅ An Yang ⋅ Andrew Zhao ⋅ Yang Yue ⋅ Shiji Song ⋅ Bowen Yu ⋅ Gao Huang ⋅ Junyang Lin
2025 Poster

Abstract

Video

Chat is not available.