Timezone: »
One of the original goals of computer vision was to fully understand a natural scene. This requires solving several problems simultaneously, including object detection, labeling of meaningful regions, and 3d reconstruction. While great progress has been made in tackling each of these problems in isolation, only recently have researchers again been considering the difficult task of assembling various methods to the mutual benefit of all. We consider learning a set of such classification models in such a way that they both solve their own problem and help each other. We develop a framework known as Cascaded Classification Models (CCM), where repeated instantiations of these classifiers are coupled by their input/output variables in a cascade that improves performance at each level. Our method requires only a limited âblack boxâ interface with the models, allowing us to use very sophisticated, state-of-the-art classifiers without having to look under the hood. We demonstrate the effectiveness of our method on a large set of natural images by combining the subtasks of scene categorization, object detection, multiclass image segmentation, and 3d scene reconstruction.
Author Information
Geremy Heitz (Stanford University)
Stephen Gould (ANU)
Ashutosh Saxena (Cornell University)
Daphne Koller (insitro)
Daphne Koller is the Rajeev Motwani Professor of Computer Science at Stanford University and the co-founder and co-CEO of Coursera, a social entrepreneurship company that works with the best universities to connect anyone around the world with the best education, for free. Coursera is the leading MOOC (Massive Open Online Course) platform, and has partnered with dozens of the world’s top universities to offer hundreds of courses in a broad range of disciplines to millions of students, spanning every country in the world. In her research life, she works in the area of machine learning and probabilistic modeling, with applications to systems biology and personalized medicine. She is the author of over 200 refereed publications in venues that span a range of disciplines, and has given over 15 keynote talks at major conferences. She is the recipient of many awards, which include the Presidential Early Career Award for Scientists and Engineers (PECASE), the MacArthur Foundation Fellowship, the ACM/Infosys award, and membership in the US National Academy of Engineering. She is also an award winning teacher, who pioneered in her Stanford class many of the ideas that underlie the Coursera user experience. She received her BSc and MSc from the Hebrew University of Jerusalem, and her PhD from Stanford in 1994.
Related Events (a corresponding poster, oral, or spotlight)
-
2008 Oral: Cascaded Classification Models: Combining Models for Holistic Scene Understanding »
Thu. Dec 11th 05:10 -- 05:30 PM Room
More from the Same Authors
-
2021 : Regression modeling on DNA encoded libraries »
Ralph Ma · Gabriel Dreiman · Fiorella Ruggiu · Adam Riesselman · Bowen Liu · Mohammad M Sultan · Daphne Koller -
2023 Poster: Revisiting Implicit Differentiation for Learning Problems in Optimal Control »
Ming Xu · Timothy L. Molloy · Stephen Gould -
2022 Spotlight: Lightning Talks 6B-2 »
Alexander Korotin · Jinyuan Jia · Weijian Deng · Shi Feng · Maying Shen · Denizalp Goktas · Fang-Yi Yu · Alexander Kolesov · Sadie Zhao · Stephen Gould · Hongxu Yin · Wenjie Qu · Liang Zheng · Evgeny Burnaev · Amy Greenwald · Neil Gong · Pavlo Molchanov · Yiling Chen · Lei Mao · Jianna Liu · Jose M. Alvarez -
2022 Spotlight: On the Strong Correlation Between Model Invariance and Generalization »
Weijian Deng · Stephen Gould · Liang Zheng -
2022 Poster: On the Strong Correlation Between Model Invariance and Generalization »
Weijian Deng · Stephen Gould · Liang Zheng -
2021 Poster: Rethinking conditional GAN training: An approach using geometrically structured latent manifolds »
Sameera Ramasinghe · Moshiur Farazi · Salman H Khan · Nick Barnes · Stephen Gould -
2020 Poster: Language and Visual Entity Relationship Graph for Agent Navigation »
Yicong Hong · Cristian Rodriguez · Yuankai Qi · Qi Wu · Stephen Gould -
2019 : In conversations: Daphne Koller and Barbara Englehardt »
Daphne Koller · Barbara Engelhardt -
2018 Poster: Partially-Supervised Image Captioning »
Peter Anderson · Stephen Gould · Mark Johnson -
2013 Poster: Learning Trajectory Preferences for Manipulators via Iterative Improvement »
Ashesh Jain · Brian Wojcik · Thorsten Joachims · Ashutosh Saxena -
2013 Invited Talk: The Online Revolution: Learning without Limits »
Daphne Koller -
2012 Poster: Shifting Weights: Adapting Object Detectors from Image to Video »
Kevin Tang · Vignesh Ramanathan · Li Fei-Fei · Daphne Koller -
2011 Poster: $\theta$-MRF: Capturing Spatial and Semantic Structure in the Parameters for Scene Understanding »
Congcong Li · Ashutosh Saxena · Tsuhan Chen -
2011 Poster: Active Classification based on Value of Classifier »
Tianshi Gao · Daphne Koller -
2011 Spotlight: Active Classification based on Value of Classifier »
Tianshi Gao · Daphne Koller -
2011 Poster: Semantic Labeling of 3D Point Clouds for Indoor Scenes »
Hema Koppula · Abhishek Anand · Thorsten Joachims · Ashutosh Saxena -
2010 Poster: Towards Holistic Scene Understanding: Feedback Enabled Cascaded Classification Models »
Congcong Li · Adarsh P Kowdle · Ashutosh Saxena · Tsuhan Chen -
2010 Poster: Self-Paced Learning for Latent Variable Models »
M. Pawan Kumar · Benjamin D Packer · Daphne Koller -
2009 Poster: Region-based Segmentation and Object Detection »
Stephen Gould · Tianshi Gao · Daphne Koller -
2009 Spotlight: Region-based Segmentation and Object Detection »
Stephen Gould · Tianshi Gao · Daphne Koller -
2009 Poster: Learning a Small Mixture of Trees »
M. Pawan Kumar · Daphne Koller -
2008 Poster: Learning Bounded Treewidth Bayesian Networks »
Gal Elidan · Stephen Gould -
2008 Demonstration: High-Accuracy 3D Sensing for Mobile Manipulators »
Stephen Gould · Morgan Quigley · Siddarth Batra · Ellen Klingbiel · Quoc V Le · Andrew Y Ng -
2008 Spotlight: Learning Bounded Treewidth Bayesian Networks »
Gal Elidan · Stephen Gould -
2008 Poster: LOOPS: Localizing Object Outlines using Probabilistic Shape »
Geremy Heitz · Gal Elidan · Benjamin D Packer · Daphne Koller -
2007 Demonstration: Holistic Scene Understanding from Visual and Range Data »
Stephen Gould · Morgan Quigley · Andrew Y Ng · Daphne Koller -
2007 Demonstration: Building a 3-D Model From a Single Still Image »
Ashutosh Saxena · min sun · Andrew Y Ng -
2006 Poster: Max-margin classification of incomplete data »
Gal Chechik · Geremy Heitz · Gal Elidan · Pieter Abbeel · Daphne Koller -
2006 Poster: Temporal and Cross-Subject Probabilistic Models for fMRI Prediction Task »
Alexis Battle · Gal Chechik · Daphne Koller -
2006 Demonstration: Peripheral-Foveal Vision for Real-time Object Recognition »
Benjamin Sapp · Stephen Gould · Adrian Kaehler · Gary R Bradski · Andrew Y Ng -
2006 Spotlight: Max-margin classification of incomplete data »
Gal Chechik · Geremy Heitz · Gal Elidan · Pieter Abbeel · Daphne Koller -
2006 Talk: Temporal and Cross-Subject Probabilistic Models for fMRI Prediction Task »
Alexis Battle · Gal Chechik · Daphne Koller -
2006 Poster: Robotic Grasping of Novel Objects »
Ashutosh Saxena · Justin Driemeyer · Justin Kearns · Andrew Y Ng -
2006 Poster: Using Combinatorial Optimization within Max-Product Belief Propagation »
John Duchi · Danny Tarlow · Gal Elidan · Daphne Koller -
2006 Spotlight: Using Combinatorial Optimization within Max-Product Belief Propagation »
John Duchi · Danny Tarlow · Gal Elidan · Daphne Koller -
2006 Spotlight: Robotic Grasping of Novel Objects »
Ashutosh Saxena · Justin Driemeyer · Justin Kearns · Andrew Y Ng -
2006 Poster: Efficient Structure Learning of Markov Networks using L1-Regularization »
Su-In Lee · Varun Ganapathi · Daphne Koller