firstbacksecondback
260 Results
Poster
|
Thu 9:00 |
TVLT: Textless Vision-Language Transformer Zineng Tang · Jaemin Cho · Yixin Nie · Mohit Bansal |
|
Poster
|
OmniVL: One Foundation Model for Image-Language and Video-Language Tasks Junke Wang · Dongdong Chen · Zuxuan Wu · Chong Luo · Luowei Zhou · Yucheng Zhao · Yujia Xie · Ce Liu · Yu-Gang Jiang · Lu Yuan |
||
Workshop
|
Mitigating Lies in Vision-Language Models Junbo Li · Xianhang Li · Cihang Xie |
||
Workshop
|
Adapting Pretrained Vision-Language Foundational Models to Medical Imaging Domains Pierre Chambon · Christian Bluethgen · Curtis Langlotz · Akshay Chaudhari |
||
Poster
|
Thu 9:00 |
ReCo: Retrieve and Co-segment for Zero-shot Transfer Gyungin Shin · Weidi Xie · Samuel Albanie |
|
Workshop
|
Robotic Skill Acquistion via Instruction Augmentation with Vision-Language Models Ted Xiao · Harris Chan · Pierre Sermanet · Ayzaan Wahid · Anthony Brohan · Karol Hausman · Sergey Levine · Jonathan Tompson |
||
Poster
|
Tue 9:00 |
WinoGAViL: Gamified Association Benchmark to Challenge Vision-and-Language Models Yonatan Bitton · Nitzan Bitton Guetta · Ron Yosef · Yuval Elovici · Mohit Bansal · Gabriel Stanovsky · Roy Schwartz |
|
Workshop
|
Robotic Skill Acquistion via Instruction Augmentation with Vision-Language Models Ted Xiao · Harris Chan · Pierre Sermanet · Ayzaan Wahid · Anthony Brohan · Karol Hausman · Sergey Levine · Jonathan Tompson |
||
Poster
|
Thu 14:00 |
Patching open-vocabulary models by interpolating weights Gabriel Ilharco · Mitchell Wortsman · Samir Yitzhak Gadre · Shuran Song · Hannaneh Hajishirzi · Simon Kornblith · Ali Farhadi · Ludwig Schmidt |
|
Poster
|
Thu 9:00 |
Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models Manli Shu · Weili Nie · De-An Huang · Zhiding Yu · Tom Goldstein · Anima Anandkumar · Chaowei Xiao |
|
Poster
|
PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model Pretraining Yuting Gao · Jinfeng Liu · Zihan Xu · Jun Zhang · Ke Li · Rongrong Ji · Chunhua Shen |
||
Poster
|
Wed 14:00 |
Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning Yujia Xie · Luowei Zhou · Xiyang Dai · Lu Yuan · Nguyen Bach · Ce Liu · Michael Zeng |