Timezone: »
For most scene understanding tasks (such as object detection or depth estimation), the classifiers need to consider contextual information in addition to the local features. We can capture such contextual information by taking as input the features/attributes from all the regions in the image. However, this contextual dependence also varies with the spatial location of the region of interest, and we therefore need a different set of parameters for each spatial location. This results in a very large number of parameters. In this work, we model the independence properties between the parameters for each location and for each task, by defining a Markov Random Field (MRF) over the parameters. In particular, two sets of parameters are encouraged to have similar values if they are spatially close or semantically close. Our method is, in principle, complementary to other ways of capturing context such as the ones that use a graphical model over the labels instead. In extensive evaluation over two different settings, of multi-class object detection and of multiple scene understanding tasks (scene categorization, depth estimation, geometric labeling), our method beats the state-of-the-art methods in all the four tasks.
Author Information
Congcong Li (Waymo)
Ashutosh Saxena (Cornell University)
Tsuhan Chen (Cornell University)
More from the Same Authors
-
2013 Poster: Learning Trajectory Preferences for Manipulators via Iterative Improvement »
Ashesh Jain · Brian Wojcik · Thorsten Joachims · Ashutosh Saxena -
2011 Poster: Semantic Labeling of 3D Point Clouds for Indoor Scenes »
Hema Koppula · Abhishek Anand · Thorsten Joachims · Ashutosh Saxena -
2010 Poster: Towards Holistic Scene Understanding: Feedback Enabled Cascaded Classification Models »
Congcong Li · Adarsh P Kowdle · Ashutosh Saxena · Tsuhan Chen -
2008 Oral: Cascaded Classification Models: Combining Models for Holistic Scene Understanding »
Geremy Heitz · Stephen Gould · Ashutosh Saxena · Daphne Koller -
2008 Poster: Cascaded Classification Models: Combining Models for Holistic Scene Understanding »
Geremy Heitz · Stephen Gould · Ashutosh Saxena · Daphne Koller -
2007 Demonstration: Building a 3-D Model From a Single Still Image »
Ashutosh Saxena · min sun · Andrew Y Ng -
2006 Poster: Robotic Grasping of Novel Objects »
Ashutosh Saxena · Justin Driemeyer · Justin Kearns · Andrew Y Ng -
2006 Spotlight: Robotic Grasping of Novel Objects »
Ashutosh Saxena · Justin Driemeyer · Justin Kearns · Andrew Y Ng