firstbacksecondback
1378 Results
Poster
|
Wed 16:30 |
SearchLVLMs: A Plug-and-Play Framework for Augmenting Large Vision-Language Models by Searching Up-to-Date Internet Knowledge Chuanhao Li · Zhen Li · Chenchen Jing · Shuo Liu · Wenqi Shao · Yuwei Wu · Ping Luo · Yu Qiao · Kaipeng Zhang |
|
Workshop
|
Decomposing Complex Visual Comprehension into Atomic Visual Skills for Vision Language Models Hyunsik Chae · Seungwoo Yoon · Chloe Yewon Chun · Gyehun Go · Yongin Cho · Gyeongmin Lee · Ernest Ryu |
||
Poster
|
Wed 16:30 |
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks Jiannan Wu · Muyan Zhong · Sen Xing · Zeqiang Lai · Zhaoyang Liu · Zhe Chen · Wenhai Wang · Xizhou Zhu · Lewei Lu · Tong Lu · Ping Luo · Yu Qiao · Jifeng Dai |
|
Workshop
|
Sat 13:00 |
Navigating Neural Fields with Vision-Language Models Neale Ratzlaff · Phillip Howard · VASUDEV LAL |
|
Poster
|
Thu 16:30 |
Matryoshka Query Transformer for Large Vision-Language Models Wenbo Hu · Zi-Yi Dou · Liunian Li · Amita Kamath · Nanyun Peng · Kai-Wei Chang |
|
Poster
|
Wed 11:00 |
No Filter: Cultural and Socioeconomic Diversity in Contrastive Vision-Language Models Angéline Pouget · Lucas Beyer · Emanuele Bugliarello · Xiao Wang · Andreas Steiner · Xiaohua Zhai · Ibrahim Alabdulmohsin |
|
Workshop
|
MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models Peng Xia · Siwei Han · Shi Qiu · Yiyang Zhou · Zhaoyang Wang · Wenhao Zheng · Zhaorun Chen · Chenhang Cui · Mingyu Ding · Linjie Li · Lijuan Wang · Huaxiu Yao |
||
Poster
|
Wed 16:30 |
VLG-CBM: Training Concept Bottleneck Models with Vision-Language Guidance Divyansh Srivastava · Ge Yan · Lily Weng |
|
Poster
|
Wed 16:30 |
Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning Simon Zhai · Hao Bai · Zipeng Lin · Jiayi Pan · Peter Tong · Yifei Zhou · Alane Suhr · Saining Xie · Yann LeCun · Yi Ma · Sergey Levine |
|
Poster
|
Thu 16:30 |
IPO: Interpretable Prompt Optimization for Vision-Language Models Yingjun Du · Wenfang Sun · Cees Snoek |
|
Workshop
|
Monkey See, Model Knew: Large Language Models accurately Predict Human AND Macaque Visual Brain Activity Colin Conwell · Emalie McMahon · Akshay Jagadeesh · Kasper Vinken · Saloni Sharma · Jacob Prince · George Alvarez · Talia Konkle · Leyla Isik · Margaret Livingstone |
||
Poster
|
Thu 16:30 |
Bridge the Modality and Capability Gaps in Vision-Language Model Selection Chao Yi · Yuhang He · De-Chuan Zhan · Han-Jia Ye |