Timezone: »
This paper explores a hierarchical prompting mechanism for the hierarchical image classification (HIC) task. Different from prior HIC methods, our hierarchical prompting is the first to explicitly inject ancestor-class information as a tokenized hint that benefits the descendant-class discrimination. We think it well imitates human visual recognition, i.e., humans may use the ancestor class as a prompt to draw focus on the subtle differences among descendant classes. We model this prompting mechanism into a Transformer with Hierarchical Prompting (TransHP). TransHP consists of three steps: 1) learning a set of prompt tokens to represent the coarse (ancestor) classes, 2) on-the-fly predicting the coarse class of the input image at an intermediate block, and 3) injecting the prompt token of the predicted coarse class into the intermediate feature. Though the parameters of TransHP maintain the same for all input images, the injected coarse-class prompt conditions (modifies) the subsequent feature extraction and encourages a dynamic focus on relatively subtle differences among the descendant classes. Extensive experiments show that TransHP improves image classification on accuracy (e.g., improving ViT-B/16 by +2.83% ImageNet classification accuracy), training data efficiency (e.g., +12.69% improvement under 10% ImageNet training data), and model explainability. Moreover, TransHP also performs favorably against prior HIC methods, showing that TransHP well exploits the hierarchical information.
Author Information
Wenhao Wang (University of Technology Sydney)
Yifan Sun (Megvii Technology Inc.)
Wei Li (Zhejiang University)
Yi Yang (Zhejiang University)
More from the Same Authors
-
2023 Poster: Global-correlated 3D-decoupling Transformer for Clothed Avatar Reconstruction »
Zechuan Zhang · Li Sun · Zongxin Yang · Ling Chen · Yi Yang -
2023 Poster: Hyperbolic Space with Hierarchical Margin Boosts Fine-Grained Learning from Coarse Labels »
Shu-Lin Xu · Yifan Sun · Faen Zhang · Anqi Xu · Xiu-Shen Wei · Yi Yang -
2023 Poster: Neural-Logic Human-Object Interaction Detection »
Liulei Li · Jianan Wei · Wenguan Wang · Yi Yang -
2023 Poster: DAC-DETR: Divide the Attention Layers and Conquer »
Zhengdong Hu · Yifan Sun · Jingdong Wang · Yi Yang -
2022 Spotlight: Lightning Talks 6B-3 »
Lingfeng Yang · Yao Lai · Zizheng Pan · Zhenyu Wang · Weicong Liang · Chuanyang Zheng · Jian-Wei Zhang · Peng Jin · Jing Liu · Xiuying Wei · Yao Mu · Xiang Li · YUHUI YUAN · Zizheng Pan · Yifan Sun · Yunchen Zhang · Jianfei Cai · Hao Luo · zheyang li · Jinfa Huang · Haoyu He · Yi Yang · Ping Luo · Fenglin Liu · Henghui Ding · Borui Zhao · Xiangguo Zhang · Kai Zhang · Pichao WANG · Bohan Zhuang · Wei Chen · Ruihao Gong · Zhi Yang · Xian Wu · Feng Ding · Jianfei Cai · Xiao Luo · Renjie Song · Weihong Lin · Jian Yang · Wenming Tan · Bohan Zhuang · Shanghang Zhang · Shen Ge · Fan Wang · Qi Zhang · Guoli Song · Jun Xiao · Hao Li · Ding Jia · David Clifton · Ye Ren · Fengwei Yu · Zheng Zhang · Jie Chen · Shiliang Pu · Xianglong Liu · Chao Zhang · Han Hu -
2022 Spotlight: Decoupling Features in Hierarchical Propagation for Video Object Segmentation »
Zongxin Yang · Yi Yang -
2022 Spotlight: Feature-Proxy Transformer for Few-Shot Segmentation »
Jian-Wei Zhang · Yifan Sun · Yi Yang · Wei Chen -
2022 Spotlight: Lightning Talks 6A-1 »
Ziyi Wang · Nian Liu · Yaming Yang · Qilong Wang · Yuanxin Liu · Zongxin Yang · Yizhao Gao · Yanchen Deng · Dongze Lian · Nanyi Fei · Ziyu Guan · Xiao Wang · Shufeng Kong · Xumin Yu · Daquan Zhou · Yi Yang · Fandong Meng · Mingze Gao · Caihua Liu · Yongming Rao · Zheng Lin · Haoyu Lu · Zhe Wang · Jiashi Feng · Zhaolin Zhang · Deyu Bo · Xinchao Wang · Chuan Shi · Jiangnan Li · Jiangtao Xie · Jie Zhou · Zhiwu Lu · Wei Zhao · Bo An · Jiwen Lu · Peihua Li · Jian Pei · Hao Jiang · Cai Xu · Peng Fu · Qinghua Hu · Yijie Li · Weigang Lu · Yanan Cao · Jianbin Huang · Weiping Wang · Zhao Cao · Jie Zhou -
2022 Spotlight: Lightning Talks 1A-4 »
Siwei Wang · Jing Liu · Nianqiao Ju · Shiqian Li · Eloïse Berthier · Muhammad Faaiz Taufiq · Arsene Fansi Tchango · Chen Liang · Chulin Xie · Jordan Awan · Jean-Francois Ton · Ziad Kobeissi · Wenguan Wang · Xinwang Liu · Kewen Wu · Rishab Goel · Jiaxu Miao · Suyuan Liu · Julien Martel · Ruobin Gong · Francis Bach · Chi Zhang · Rob Cornish · Sanmi Koyejo · Zhi Wen · Yee Whye Teh · Yi Yang · Jiaqi Jin · Bo Li · Yixin Zhu · Vinayak Rao · Wenxuan Tu · Gaetan Marceau Caron · Arnaud Doucet · Xinzhong Zhu · Joumana Ghosn · En Zhu -
2022 Spotlight: GMMSeg: Gaussian Mixture based Generative Semantic Segmentation Models »
Chen Liang · Wenguan Wang · Jiaxu Miao · Yi Yang -
2022 Poster: Feature-Proxy Transformer for Few-Shot Segmentation »
Jian-Wei Zhang · Yifan Sun · Yi Yang · Wei Chen -
2022 Poster: GMMSeg: Gaussian Mixture based Generative Semantic Segmentation Models »
Chen Liang · Wenguan Wang · Jiaxu Miao · Yi Yang -
2022 Poster: Decoupling Features in Hierarchical Propagation for Video Object Segmentation »
Zongxin Yang · Yi Yang -
2021 Poster: Spatial Ensemble: a Novel Model Smoothing Mechanism for Student-Teacher Framework »
Tengteng Huang · Yifan Sun · Xun Wang · Haotian Yao · Chi Zhang