Skip to yearly menu bar Skip to main content


Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency

Hongyu Li · Songhao Han · Yue Liao · Junfeng Luo · Jialin Gao · Shuicheng Yan · Si Liu

Abstract

Video

Chat is not available.