Skip to yearly menu bar Skip to main content


Why GRPO Needs Normalization: A Local-Curvature Perspective on Adaptive Gradients

Cheng Ge ⋅ Heqi Yin ⋅ Hao Liang ⋅ Jiawei Zhang

Abstract

Chat is not available.