Timezone: »
In visual recognition, the images are frequently modeled as sets of local features (bags). We show that bag of words, a common method to handle such cases, can be viewed as a special match kernel, which counts 1 if two local features fall into the same regions partitioned by visual words and 0 otherwise. Despite its simplicity, this quantization is too coarse. It is, therefore, appealing to design match kernels that more accurately measure the similarity between local features. However, it is impractical to use such kernels on large datasets due to their significant computational cost. To address this problem, we propose an efficient match kernel (EMK), which maps local features to a low dimensional feature space, average the resulting feature vectors to form a set-level feature, then apply a linear classifier. The local feature maps are learned so that their inner products preserve, to the best possible, the values of the specified kernel function. EMK is linear both in the number of images and in the number of local features. We demonstrate that EMK is extremely efficient and achieves the current state of the art performance on three difficult real world datasets: Scene-15, Caltech-101 and Caltech-256.
Author Information
Liefeng Bo (Intel Labs)
Cristian Sminchisescu (Lund University/Google)
Related Events (a corresponding poster, oral, or spotlight)
-
2009 Poster: Efficient Match Kernel between Sets of Features for Visual Recognition »
Thu. Dec 10th 03:00 -- 07:59 AM Room
More from the Same Authors
-
2021 Spotlight: H-NeRF: Neural Radiance Fields for Rendering and Temporal Reconstruction of Humans in Motion »
Hongyi Xu · Thiemo Alldieck · Cristian Sminchisescu -
2021 Poster: Relative Flatness and Generalization »
Henning Petzka · Michael Kamp · Linara Adilova · Cristian Sminchisescu · Mario Boley -
2021 Poster: REMIPS: Physically Consistent 3D Reconstruction of Multiple Interacting People under Weak Supervision »
Mihai Fieraru · Mihai Zanfir · Teodor Szente · Eduard Bazavan · Vlad Olaru · Cristian Sminchisescu -
2021 Poster: H-NeRF: Neural Radiance Fields for Rendering and Temporal Reconstruction of Humans in Motion »
Hongyi Xu · Thiemo Alldieck · Cristian Sminchisescu -
2012 Poster: Discriminatively Trained Sparse Code Gradients for Contour Detection »
Xiaofeng Ren · Liefeng Bo -
2012 Poster: Unsupervised template learning for fine-grained object recognition »
Shulin Yang · Liefeng Bo · Jue Wang · Linda Shapiro -
2012 Spotlight: Discriminatively Trained Sparse Code Gradients for Contour Detection »
Xiaofeng Ren · Liefeng Bo -
2011 Poster: Probabilistic Joint Image Segmentation and Labeling »
Adrian Ion · Joao Carreira · Cristian Sminchisescu -
2011 Spotlight: Probabilistic Joint Image Segmentation and Labeling »
Adrian Ion · Joao Carreira · Cristian Sminchisescu -
2011 Poster: Hierarchical Matching Pursuit for Recognition: Architecture and Fast Algorithms »
Liefeng Bo · Xiaofeng Ren · Dieter Fox -
2010 Spotlight: Kernel Descriptors for Visual Recognition »
Liefeng Bo · Xiaofeng Ren · Dieter Fox -
2010 Poster: Kernel Descriptors for Visual Recognition »
Liefeng Bo · Xiaofeng Ren · Dieter Fox -
2010 Poster: Convex Multiple-Instance Learning by Estimating Likelihood Ratio »
Fuxin Li · Cristian Sminchisescu -
2009 Poster: Conditional Neural Fields »
Jian Peng · Liefeng Bo · Jinbo Xu -
2007 Poster: People Tracking with the Laplacian Eigenmaps Latent Variable Model »
Zhengdong Lu · Miguel A. Carreira-Perpinan · Cristian Sminchisescu