Skip to yearly menu bar Skip to main content


Spotlight Poster Wed, Dec 3, 2025 • 11:00 AM – 2:00 PM PST

DAPO : Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage-Based Policy Optimization

Jiacai Liu ⋅ Chaojie Wang ⋅ Chris Liu ⋅ Liang Zeng ⋅ Rui Yan ⋅ Yiwen Sun ⋅ Yang Liu

Abstract

Video

Chat is not available.