Timezone: »
Finding correspondences between images is a fundamental problem in computer vision. In this paper, we show that correspondence emerges in image diffusion models without any explicit supervision. We propose a simple strategy to extract this implicit knowledge out of diffusion networks as image features, namely DIffusion FeaTures (DIFT), and use them to establish correspondences between real images. Without any additional fine-tuning or supervision on the task-specific data or annotations, DIFT is able to outperform both weakly-supervised methods and competitive off-the-shelf features in identifying semantic, geometric, and temporal correspondences. Particularly for semantic correspondence, DIFT from Stable Diffusion is able to outperform DINO and OpenCLIP by 19 and 14 accuracy points respectively on the challenging SPair-71k benchmark. It even outperforms the state-of-the-art supervised methods on 9 out of 18 categories while remaining on par for the overall performance. Project page: https://diffusionfeatures.github.io.
Author Information
Luming Tang (Cornell University)
Menglin Jia (Cornell University)
Qianqian Wang (Cornell university)
Cheng Perng Phoo (Cornell University)
Bharath Hariharan (Cornell University)
More from the Same Authors
-
2023 Poster: Reward Finetuning for Faster and More Accurate Unsupervised Object Discovery »
Katie Luo · Zhenzhen Liu · Xiangyu Chen · Yurong You · Sagie Benaim · Cheng Perng Phoo · Mark Campbell · Wen Sun · Bharath Hariharan · Kilian Weinberger -
2023 Poster: Dynamo-Depth: Fixing Unsupervised Depth Estimation for Dynamical Scenes »
Yihong Sun · Bharath Hariharan -
2022 Poster: Unsupervised Adaptation from Repeated Traversals for Autonomous Driving »
Yurong You · Cheng Perng Phoo · Katie Luo · Travis Zhang · Wei-Lun (Harry) Chao · Bharath Hariharan · Mark Campbell · Kilian Weinberger -
2022 Poster: Change Event Dataset for Discovery from Spatio-temporal Remote Sensing Imagery »
Utkarsh Mall · Bharath Hariharan · Kavita Bala -
2022 Poster: Polynomial Neural Fields for Subband Decomposition and Manipulation »
Guandao Yang · Sagie Benaim · Varun Jampani · Kyle Genova · Jonathan Barron · Thomas Funkhouser · Bharath Hariharan · Serge Belongie