firstbacksecondback
159 Results
Poster
|
Thu 9:00 |
VLMo: Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts Hangbo Bao · Wenhui Wang · Li Dong · Qiang Liu · Owais Khan Mohammed · Kriti Aggarwal · Subhojit Som · Songhao Piao · Furu Wei |
|
Poster
|
Contrastive Language-Image Pre-Training with Knowledge Graphs Xuran Pan · Tianzhu Ye · Dongchen Han · Shiji Song · Gao Huang |
||
Poster
|
PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model Pretraining Yuting Gao · Jinfeng Liu · Zihan Xu · Jun Zhang · Ke Li · Rongrong Ji · Chunhua Shen |
||
Poster
|
Thu 9:00 |
Wukong: A 100 Million Large-scale Chinese Cross-modal Pre-training Benchmark Jiaxi Gu · Xiaojun Meng · Guansong Lu · Lu Hou · Niu Minzhe · Xiaodan Liang · Lewei Yao · Runhui Huang · Wei Zhang · Xin Jiang · Chunjing XU · Hang Xu |
|
Poster
|
Thu 14:00 |
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone Zi-Yi Dou · Aishwarya Kamath · Zhe Gan · Pengchuan Zhang · Jianfeng Wang · Linjie Li · Zicheng Liu · Ce Liu · Yann LeCun · Nanyun Peng · Jianfeng Gao · Lijuan Wang |
|
Poster
|
Tue 9:00 |
UniCLIP: Unified Framework for Contrastive Language-Image Pre-training Janghyeon Lee · Jongsuk Kim · Hyounguk Shon · Bumsoo Kim · Seung Hwan Kim · Honglak Lee · Junmo Kim |
|
Poster
|
TaiSu: A 166M Large-scale High-Quality Dataset for Chinese Vision-Language Pre-training Yulong Liu · Guibo Zhu · Bin Zhu · Qi Song · Guojing Ge · Haoran Chen · GuanHui Qiao · Ru Peng · Lingxiang Wu · Jinqiao Wang |
||
Poster
|
Tue 14:00 |
Fine-Grained Semantically Aligned Vision-Language Pre-Training Juncheng Li · XIN HE · Longhui Wei · Long Qian · Linchao Zhu · Lingxi Xie · Yueting Zhuang · Qi Tian · Siliang Tang |
|
Poster
|
Thu 9:00 |
TVLT: Textless Vision-Language Transformer Zineng Tang · Jaemin Cho · Yixin Nie · Mohit Bansal |
|
Poster
|
OmniVL: One Foundation Model for Image-Language and Video-Language Tasks Junke Wang · Dongdong Chen · Zuxuan Wu · Chong Luo · Luowei Zhou · Yucheng Zhao · Yujia Xie · Ce Liu · Yu-Gang Jiang · Lu Yuan |
||
Workshop
|
Fri 9:15 |
Carsten Binnig - Pre-trained Models for Learned DBMS Components Carsten Binnig |
|
Workshop
|
Fri 6:00 |
Fine-grained Interactive Vision Language Pre-training Lu Hou · Lu Hou |