Skip to yearly menu bar Skip to main content


Building and better understanding vision-language models: insights and future directions

Hugo Laurençon ⋅ Andrés Marafioti ⋅ Victor Sanh ⋅ Leo Tronchon
Keywords: VLM multimodal

Abstract

Chat is not available.