Skip to yearly menu bar Skip to main content


Poster

VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks

Jiannan Wu ⋅ Muyan Zhong ⋅ Sen Xing ⋅ Zeqiang Lai ⋅ Zhaoyang Liu ⋅ Zhe Chen ⋅ Wenhai Wang ⋅ Xizhou Zhu ⋅ Lewei Lu ⋅ Tong Lu ⋅ Ping Luo ⋅ Yu Qiao ⋅ Jifeng Dai
2024 Poster
[ Paper [ Poster [ OpenReview

Abstract

Video

Chat is not available.