Poster
|
Wed 9:00 |
Robustness Analysis of Video-Language Models Against Visual and Language Perturbations Madeline Chantry · Shruti Vyas · Hamid Palangi · Yogesh Rawat · Vibhav Vineet |
|
Poster
|
Thu 14:00 |
CLiMB: A Continual Learning Benchmark for Vision-and-Language Tasks Tejas Srinivasan · Ting-Yun Chang · Leticia Pinto Alva · Georgios Chochlakis · Mohammad Rostami · Jesse Thomason |
|
Poster
|
Wed 9:00 |
MSDS: A Large-Scale Chinese Signature and Token Digit String Dataset for Handwriting Verification Peirong Zhang · Jiajia Jiang · Yuliang Liu · Lianwen Jin |
|
Poster
|
Tue 9:00 |
ActionSense: A Multimodal Dataset and Recording Framework for Human Activities Using Wearable Sensors in a Kitchen Environment Joseph DelPreto · Chao Liu · Yiyue Luo · Michael Foshey · Yunzhu Li · Antonio Torralba · Wojciech Matusik · Daniela Rus |
|
Poster
|
Tue 9:00 |
CAESAR: An Embodied Simulator for Generating Multimodal Referring Expression Datasets Md Mofijul Islam · Reza Mirzaiee · Alexi Gladstone · Haley Green · Tariq Iqbal |
|
Poster
|
Tue 14:00 |
Multimodal Contrastive Learning with LIMoE: the Language-Image Mixture of Experts Basil Mustafa · Carlos Riquelme · Joan Puigcerver · Rodolphe Jenatton · Neil Houlsby |
|
Poster
|
Wed 14:00 |
CLOOB: Modern Hopfield Networks with InfoLOOB Outperform CLIP Andreas Fürst · Elisabeth Rumetshofer · Johannes Lehner · Viet T. Tran · Fei Tang · Hubert Ramsauer · David Kreil · Michael Kopp · Günter Klambauer · Angela Bitto · Sepp Hochreiter |
|
Poster
|
Divert More Attention to Vision-Language Tracking Mingzhe Guo · Zhipeng Zhang · Heng Fan · Liping Jing |
||
Poster
|
Tue 9:00 |
Towards Versatile Embodied Navigation Hanqing Wang · Wei Liang · Luc V Gool · Wenguan Wang |
|
Poster
|
Thu 9:00 |
Mutual Information Divergence: A Unified Metric for Multimodal Generative Models Jin-Hwa Kim · Yunji Kim · Jiyoung Lee · Kang Min Yoo · Sang-Woo Lee |
|
Poster
|
Wed 14:00 |
Cross-modal Learning for Image-Guided Point Cloud Shape Completion Emanuele Aiello · Diego Valsesia · Enrico Magli |
|
Poster
|
Wed 9:00 |
VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation Kaizhi Zheng · Xiaotong Chen · Odest Chadwicke Jenkins · Xin Wang |