firstbacksecondback
884 Results
Poster
|
Fri 16:30 |
: Exploring Embodied Emotion Through A Large-Scale Egocentric Video Dataset wang lin · Yueying Feng · WenKang Han · Tao Jin · Zhou Zhao · Fei Wu · Chang Yao · Jingyuan Chen |
|
Workshop
|
Sat 15:45 |
Auto-Evaluation with Few Labels through Post-hoc Regression Benjamin Eyre · David Madras |
|
Poster
|
Wed 16:30 |
DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution Yang Yue · Yulin Wang · Bingyi Kang · Yizeng Han · Shenzhi Wang · Shiji Song · Jiashi Feng · Gao Huang |
|
Workshop
|
Sat 10:30 |
Efficient Generative Multimodal Integration (EGMI): Enabling Audio Generation from Text-Image Pairs through Alignment with Large Language Models Taemin Kim · Wooyeol Baek · Heeseok Oh |
|
Poster
|
Thu 16:30 |
SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation Jonathan Roberts · Kai Han · Neil Houlsby · Samuel Albanie |
|
Poster
|
Toward a Stable, Fair, and Comprehensive Evaluation of Object Hallucination in Large Vision-Language Models Hongliang Wei · Xingtao Wang · Xianqi Zhang · Xiaopeng Fan · Debin Zhao |
||
Expo Demonstration
|
Tue 15:00 |
Large Multimodal Model running on a mobile device Ron Tindall |
|
Poster
|
Wed 16:30 |
A Concept-Based Explainability Framework for Large Multimodal Models Jayneel Parekh · Pegah KHAYATAN · Mustafa Shukor · Alasdair Newson · Matthieu Cord |
|
Workshop
|
A Concept-Based Explainability Framework for Large Multimodal Models Jayneel Parekh · Pegah KHAYATAN · Mustafa Shukor · Alasdair Newson · Matthieu Cord |
||
Poster
|
Thu 16:30 |
Q-VLM: Post-training Quantization for Large Vision-Language Models Changyuan Wang · Ziwei Wang · Xiuwei Xu · Yansong Tang · Jie Zhou · Jiwen Lu |
|
Poster
|
Thu 11:00 |
Visual Riddles: a Commonsense and World Knowledge Challenge for Large Vision and Language Models Nitzan Bitton Guetta · Aviv Slobodkin · Aviya Maimon · Eliya Habba · Royi Rassin · Yonatan Bitton · Idan Szpektor · Amir Globerson · Yuval Elovici |
|
Affinity Event
|
ColFlor: Towards BERT-Size Vision-Language Document Retrieval Models Ahmed Masry · Enamul Hoque |