Skip to yearly menu bar Skip to main content


Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency

Hongyu Li ⋅ Songhao Han ⋅ Yue Liao ⋅ Junfeng Luo ⋅ Jialin Gao ⋅ Shuicheng Yan ⋅ Si Liu

Abstract

Video

Chat is not available.