Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

1378 Results

<<   <   Page 3 of 115   >   >>
Poster
Wed 16:30 SearchLVLMs: A Plug-and-Play Framework for Augmenting Large Vision-Language Models by Searching Up-to-Date Internet Knowledge
Chuanhao Li · Zhen Li · Chenchen Jing · Shuo Liu · Wenqi Shao · Yuwei Wu · Ping Luo · Yu Qiao · Kaipeng Zhang
Workshop
Decomposing Complex Visual Comprehension into Atomic Visual Skills for Vision Language Models
Hyunsik Chae · Seungwoo Yoon · Chloe Yewon Chun · Gyehun Go · Yongin Cho · Gyeongmin Lee · Ernest Ryu
Poster
Wed 16:30 VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks
Jiannan Wu · Muyan Zhong · Sen Xing · Zeqiang Lai · Zhaoyang Liu · Zhe Chen · Wenhai Wang · Xizhou Zhu · Lewei Lu · Tong Lu · Ping Luo · Yu Qiao · Jifeng Dai
Workshop
Sat 13:00 Navigating Neural Fields with Vision-Language Models
Neale Ratzlaff · Phillip Howard · VASUDEV LAL
Poster
Thu 16:30 Matryoshka Query Transformer for Large Vision-Language Models
Wenbo Hu · Zi-Yi Dou · Liunian Li · Amita Kamath · Nanyun Peng · Kai-Wei Chang
Poster
Wed 11:00 No Filter: Cultural and Socioeconomic Diversity in Contrastive Vision-Language Models
Angéline Pouget · Lucas Beyer · Emanuele Bugliarello · Xiao Wang · Andreas Steiner · Xiaohua Zhai · Ibrahim Alabdulmohsin
Workshop
MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
Peng Xia · Siwei Han · Shi Qiu · Yiyang Zhou · Zhaoyang Wang · Wenhao Zheng · Zhaorun Chen · Chenhang Cui · Mingyu Ding · Linjie Li · Lijuan Wang · Huaxiu Yao
Poster
Wed 16:30 VLG-CBM: Training Concept Bottleneck Models with Vision-Language Guidance
Divyansh Srivastava · Ge Yan · Lily Weng
Poster
Wed 16:30 Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Simon Zhai · Hao Bai · Zipeng Lin · Jiayi Pan · Peter Tong · Yifei Zhou · Alane Suhr · Saining Xie · Yann LeCun · Yi Ma · Sergey Levine
Poster
Thu 16:30 IPO: Interpretable Prompt Optimization for Vision-Language Models
Yingjun Du · Wenfang Sun · Cees Snoek
Workshop
Monkey See, Model Knew: Large Language Models accurately Predict Human AND Macaque Visual Brain Activity
Colin Conwell · Emalie McMahon · Akshay Jagadeesh · Kasper Vinken · Saloni Sharma · Jacob Prince · George Alvarez · Talia Konkle · Leyla Isik · Margaret Livingstone
Poster
Thu 16:30 Bridge the Modality and Capability Gaps in Vision-Language Model Selection
Chao Yi · Yuhang He · De-Chuan Zhan · Han-Jia Ye