Skip to yearly menu bar Skip to main content


Spotlight Poster

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Chaoyou Fu ⋅ Haojia Lin ⋅ Xiong Wang ⋅ yifan zhang ⋅ Yunhang Shen ⋅ Xiaoyu Liu ⋅ Haoyu Cao ⋅ Zuwei Long ⋅ Heting Gao ⋅ Ke Li ⋅ Long MA ⋅ Xiawu Zheng ⋅ Rongrong Ji ⋅ Xing Sun ⋅ Caifeng Shan ⋅ Ran He
2025 Spotlight Poster

Abstract

Video

Chat is not available.