Timezone: »
Amodal perception requires inferring the full shape of an object that is partially occluded. This task is particularly challenging on two levels: (1) it requires more information than what is contained in the instant retina or imaging sensor, (2) it is difficult to obtain enough well-annotated amodal labels for supervision. To this end, this paper develops a new framework of Self-supervised amodal Video object segmentation (SaVos). Our method efficiently leverages the visual information of video temporal sequences to infer the amodal mask of objects. The key intuition is that the occluded part of an object can be explained away if that part is visible in other frames, possibly deformed as long as the deformation can be reasonably learned. Accordingly, we derive a novel self-supervised learning paradigm that efficiently utilizes the visible object parts as the supervision to guide the training on videos. In addition to learning type prior to complete masks for known types, SaVos also learns the spatiotemporal prior, which is also useful for the amodal task and could generalize to unseen types. The proposed framework achieves the state-of-the-art performance on the synthetic amodal segmentation benchmark FISHBOWL and the real world benchmark KINS-Video-Car. Further, it lends itself well to being transferred to novel distributions using test-time adaptation, outperforming existing models even after the transfer to a new distribution.
Author Information
Jian Yao (Fudan University)
Yuxin Hong (Fudan University)
Chiyu Wang (University of California, Berkeley)
Tianjun Xiao (Amazon AI)
Tong He (Amazon Web Services)
Francesco Locatello (Amazon)
David P Wipf (AWS)
Yanwei Fu (Fudan University, Shanghai;)
Zheng Zhang (Shanghai New York Univeristy)
More from the Same Authors
-
2021 : A Closer Look at Distribution Shifts and Out-of-Distribution Generalization on Graphs »
Mucong Ding · Kezhi Kong · Jiuhai Chen · John Kirchenbauer · Micah Goldblum · David P Wipf · Furong Huang · Tom Goldstein -
2022 Poster: Learning Enhanced Representation for Tabular Data via Neighborhood Propagation »
Kounianhua Du · Weinan Zhang · Ruiwen Zhou · Yangkun Wang · Xilong Zhao · Jiarui Jin · Quan Gan · Zheng Zhang · David P Wipf -
2022 : Scalable Causal Discovery with Score Matching »
Francesco Montagna · Nicoletta Noceti · Lorenzo Rosasco · Kun Zhang · Francesco Locatello -
2022 Spotlight: Lightning Talks 5B-3 »
Yanze Wu · Jie Xiao · Nianzu Yang · Jieyi Bi · Jian Yao · Yiting Chen · Qizhou Wang · Yangru Huang · Yongqiang Chen · Peixi Peng · Yuxin Hong · Xintao Wang · Feng Liu · Yining Ma · Qibing Ren · Xueyang Fu · Yonggang Zhang · Kaipeng Zeng · Jiahai Wang · GEN LI · Yonggang Zhang · Qitian Wu · Yifan Zhao · Chiyu Wang · Junchi Yan · Feng Wu · Yatao Bian · Xiaosong Jia · Ying Shan · Zhiguang Cao · Zheng-Jun Zha · Guangyao Chen · Tianjun Xiao · Han Yang · Jing Zhang · Jinbiao Chen · MA Kaili · Yonghong Tian · Junchi Yan · Chen Gong · Tong He · Binghui Xie · Yuan Sun · Francesco Locatello · Tongliang Liu · Yeow Meng Chee · David P Wipf · Tongliang Liu · Bo Han · Bo Han · Yanwei Fu · James Cheng · Zheng Zhang -
2022 Spotlight: Self-supervised Amodal Video Object Segmentation »
Jian Yao · Yuxin Hong · Chiyu Wang · Tianjun Xiao · Tong He · Francesco Locatello · David P Wipf · Yanwei Fu · Zheng Zhang -
2022 Spotlight: NodeFormer: A Scalable Graph Structure Learning Transformer for Node Classification »
Qitian Wu · Wentao Zhao · Zenan Li · David P Wipf · Junchi Yan -
2022 Spotlight: Lightning Talks 1B-1 »
Qitian Wu · Runlin Lei · Rongqin Chen · Luca Pinchetti · Yangze Zhou · Abhinav Kumar · Hans Hao-Hsun Hsu · Wentao Zhao · Chenhao Tan · Zhen Wang · Shenghui Zhang · Yuesong Shen · Tommaso Salvatori · Gitta Kutyniok · Zenan Li · Amit Sharma · Leong Hou U · Yordan Yordanov · Christian Tomani · Bruno Ribeiro · Yaliang Li · David P Wipf · Daniel Cremers · Bolin Ding · Beren Millidge · Ye Li · Yuhang Song · Junchi Yan · Zhewei Wei · Thomas Lukasiewicz -
2022 Poster: Are Two Heads the Same as One? Identifying Disparate Treatment in Fair Neural Networks »
Michael Lohaus · Matthäus Kleindessner · Krishnaram Kenthapadi · Francesco Locatello · Chris Russell -
2022 Poster: NodeFormer: A Scalable Graph Structure Learning Transformer for Node Classification »
Qitian Wu · Wentao Zhao · Zenan Li · David P Wipf · Junchi Yan -
2022 Poster: Neural Attentive Circuits »
Martin Weiss · Nasim Rahaman · Francesco Locatello · Chris Pal · Yoshua Bengio · Bernhard Schölkopf · Erran Li Li · Nicolas Ballas -
2022 Poster: Assaying Out-Of-Distribution Generalization in Transfer Learning »
Florian Wenzel · Andrea Dittadi · Peter Gehler · Carl-Johann Simon-Gabriel · Max Horn · Dominik Zietlow · David Kernert · Chris Russell · Thomas Brox · Bernt Schiele · Bernhard Schölkopf · Francesco Locatello -
2022 Poster: Transformers from an Optimization Perspective »
Yongyi Yang · zengfeng Huang · David P Wipf -
2022 Poster: Descent Steps of a Relation-Aware Energy Produce Heterogeneous Graph Neural Networks »
Hongjoon Ahn · Yongyi Yang · Quan Gan · Taesup Moon · David P Wipf -
2022 Poster: Learning Manifold Dimensions with Conditional Variational Autoencoders »
Yijia Zheng · Tong He · Yixuan Qiu · David P Wipf -
2021 : A Closer Look at Distribution Shifts and Out-of-Distribution Generalization on Graphs »
Mucong Ding · Kezhi Kong · Jiuhai Chen · John Kirchenbauer · Micah Goldblum · David P Wipf · Furong Huang · Tom Goldstein -
2021 Poster: GRIN: Generative Relation and Intention Network for Multi-agent Trajectory Prediction »
Longyuan Li · Jian Yao · Li Wenliang · Tong He · Tianjun Xiao · Junchi Yan · David Wipf · Zheng Zhang -
2021 Poster: Progressive Coordinate Transforms for Monocular 3D Object Detection »
Li Wang · Li Zhang · Yi Zhu · Zhi Zhang · Tong He · Mu Li · Xiangyang Xue -
2021 Poster: The Image Local Autoregressive Transformer »
Chenjie Cao · Yuxin Hong · Xiang Li · Chengrong Wang · Chengming Xu · Yanwei Fu · Xiangyang Xue -
2020 Poster: Further Analysis of Outlier Detection with Deep Generative Models »
Ziyu Wang · Bin Dai · David P Wipf · Jun Zhu -
2019 : Invited Presentation: Deep Graph Library »
Zheng Zhang -
2019 Poster: Meta-Reinforced Synthetic Data for One-Shot Fine-Grained Visual Recognition »
Satoshi Tsutsui · Yanwei Fu · David Crandall -
2018 Poster: Loss Functions for Multiset Prediction »
Sean Welleck · Zixin Yao · Yu Gai · Jialin Mao · Zheng Zhang · Kyunghyun Cho -
2018 Poster: Stacked Semantics-Guided Attention Model for Fine-Grained Zero-Shot Learning »
yunlong yu · Zhong Ji · Yanwei Fu · Jichang Guo · Yanwei Pang · Zhongfei (Mark) Zhang -
2017 Poster: Saliency-based Sequential Image Attention with Multiset Prediction »
Sean Welleck · Jialin Mao · Kyunghyun Cho · Zheng Zhang -
2012 Poster: Dual-Space Analysis of the Sparse Linear Model »
David P Wipf -
2011 Poster: Sparse Estimation with Structured Dictionaries »
David P Wipf -
2011 Spotlight: Sparse Estimation with Structured Dictionaries »
David P Wipf -
2009 Poster: Sparse Estimation Using General Likelihoods and Non-Factorial Priors »
David P Wipf · Sri Nagarajan -
2008 Poster: Estimating the Location and Orientation of Complex, Correlated Neural Activity using MEG »
David P Wipf · Julia Owen · Hagai Attias · Kensuke Sekihara · Sri Nagarajan -
2008 Spotlight: Estimating the Location and Orientation of Complex, Correlated Neural Activity using MEG »
David P Wipf · Julia Owen · Hagai Attias · Kensuke Sekihara · Sri Nagarajan -
2007 Poster: A New View of Automatic Relevance Determination »
David P Wipf · Srikantan Nagarajan -
2006 Poster: Analysis of Empirical Bayesian Methods for Neuroelectromagnetic Source Localization »
David P Wipf · Rey R Ramirez · Jason A Palmer · Scott Makeig · Bhaskar Rao -
2006 Spotlight: Analysis of Empirical Bayesian Methods for Neuroelectromagnetic Source Localization »
David P Wipf · Rey R Ramirez · Jason A Palmer · Scott Makeig · Bhaskar Rao