Skip to yearly menu bar Skip to main content


RLVR vs. Distillation: Understanding Accuracy and Capability in LLM Mathematical Reasoning

Minwu Kim ⋅ Anubhav Shrestha ⋅ Safal Shrestha ⋅ Aadim Nepal ⋅ Keith Ross

Abstract

Chat is not available.