Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

884 Results

<<   <   Page 1 of 74   >   >>
Poster
Fri 16:30 E3: Exploring Embodied Emotion Through A Large-Scale Egocentric Video Dataset
wang lin · Yueying Feng · WenKang Han · Tao Jin · Zhou Zhao · Fei Wu · Chang Yao · Jingyuan Chen
Workshop
Sat 15:45 Auto-Evaluation with Few Labels through Post-hoc Regression
Benjamin Eyre · David Madras
Poster
Wed 16:30 DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
Yang Yue · Yulin Wang · Bingyi Kang · Yizeng Han · Shenzhi Wang · Shiji Song · Jiashi Feng · Gao Huang
Workshop
Sat 10:30 Efficient Generative Multimodal Integration (EGMI): Enabling Audio Generation from Text-Image Pairs through Alignment with Large Language Models
Taemin Kim · Wooyeol Baek · Heeseok Oh
Poster
Thu 16:30 SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation
Jonathan Roberts · Kai Han · Neil Houlsby · Samuel Albanie
Poster
Toward a Stable, Fair, and Comprehensive Evaluation of Object Hallucination in Large Vision-Language Models
Hongliang Wei · Xingtao Wang · Xianqi Zhang · Xiaopeng Fan · Debin Zhao
Expo Demonstration
Tue 15:00 Large Multimodal Model running on a mobile device
Ron Tindall
Poster
Wed 16:30 A Concept-Based Explainability Framework for Large Multimodal Models
Jayneel Parekh · Pegah KHAYATAN · Mustafa Shukor · Alasdair Newson · Matthieu Cord
Workshop
A Concept-Based Explainability Framework for Large Multimodal Models
Jayneel Parekh · Pegah KHAYATAN · Mustafa Shukor · Alasdair Newson · Matthieu Cord
Poster
Thu 16:30 Q-VLM: Post-training Quantization for Large Vision-Language Models
Changyuan Wang · Ziwei Wang · Xiuwei Xu · Yansong Tang · Jie Zhou · Jiwen Lu
Poster
Thu 11:00 Visual Riddles: a Commonsense and World Knowledge Challenge for Large Vision and Language Models
Nitzan Bitton Guetta · Aviv Slobodkin · Aviya Maimon · Eliya Habba · Royi Rassin · Yonatan Bitton · Idan Szpektor · Amir Globerson · Yuval Elovici
Affinity Event
ColFlor: Towards BERT-Size Vision-Language Document Retrieval Models
Ahmed Masry · Enamul Hoque