Skip to yearly menu bar Skip to main content


Poster

First SFT, Second RL, Third UPT: Continual Improving Multi-Modal LLM Reasoning via Unsupervised Post-Training

Lai Wei ⋅ Yuting Li ⋅ Chen Wang ⋅ Yue Wang ⋅ Linghe Kong ⋅ Weiran Huang ⋅ Lichao Sun
2025 Poster

Abstract

Video

Chat is not available.