firstbacksecondback
1333 Results
Poster
|
Thu 16:30 |
IPO: Interpretable Prompt Optimization for Vision-Language Models Yingjun Du · Wenfang Sun · Cees Snoek |
|
Workshop
|
Sat 15:45 |
Auto-Evaluation with Few Labels through Post-hoc Regression Benjamin Eyre · David Madras |
|
Poster
|
Thu 16:30 |
Bridge the Modality and Capability Gaps in Vision-Language Model Selection Chao Yi · Yuhang He · De-Chuan Zhan · Han-Jia Ye |
|
Poster
|
Toward a Stable, Fair, and Comprehensive Evaluation of Object Hallucination in Large Vision-Language Models Hongliang Wei · Xingtao Wang · Xianqi Zhang · Xiaopeng Fan · Debin Zhao |
||
Poster
|
Thu 16:30 |
GITA: Graph to Visual and Textual Integration for Vision-Language Graph Reasoning Yanbin Wei · Shuai Fu · Weisen Jiang · Zejian Zhang · Zhixiong Zeng · Qi Wu · James Kwok · Yu Zhang |
|
Workshop
|
Understanding Graphical Perception in Data Visualization through Zero-shot Prompting of Vision-Language Models Grace Guo · Jenna Kang · Raj Sanjay Shah · Hanspeter Pfister · Sashank Varma |
||
Poster
|
Thu 16:30 |
Hidden in Plain Sight: Evaluating Abstract Shape Recognition in Vision-Language Models Arshia Hemmat · Adam Davies · Tom Lamb · Jianhao Yuan · Philip Torr · Ashkan Khakzar · Francesco Pinto |
|
Workshop
|
Quo Vadis, Video Understanding with Vision-Language Foundation Models? Mahmoud ALI · Di Yang · Arkaprava Sinha · Dominick Reilly · Srijan Das · Gianpiero Francesca · francois bremond |
||
Poster
|
Wed 11:00 |
SpatialRGPT: Grounded Spatial Reasoning in Vision-Language Models An-Chieh Cheng · Hongxu Yin · Yang Fu · Qiushan Guo · Ruihan Yang · Jan Kautz · Xiaolong Wang · Sifei Liu |
|
Workshop
|
Assisted Few-Shot Learning for Vision-Language Models in Agricultural Stress Phenotype Identification Muhammad Arbab Arshad · Talukder "Zaki" Jubery · Asheesh Singh · ARTI SINGH · Chinmay Hegde · Baskar Ganapathysubramanian · Aditya Balu · Adarsh Krishnamurthy · Soumik Sarkar |
||
Workshop
|
Can Vision-Language Models Replace Human Annotators: A Case Study with CelebA Dataset Haoming Lu · Feifei Zhong |
||
Poster
|
Wed 16:30 |
Conjugated Semantic Pool Improves OOD Detection with Pre-trained Vision-Language Models Mengyuan Chen · Junyu Gao · Changsheng Xu |