Timezone: »
Clustering is often formulated as a discrete optimization problem. The objective is to find, among all partitions of the data set, the best one according to some quality measure. However, in the statistical setting where we assume that the finite data set has been sampled from some underlying space, the goal is not to find the best partition of the given sample, but to approximate the true partition of the underlying space. We argue that the discrete optimization approach usually does not achieve this goal. As an alternative, we suggest the paradigm of ``nearest neighbor clustering''. Instead of selecting the best out of all partitions of the sample, it only considers partitions in some restricted function class. Using tools from statistical learning theory we prove that nearest neighbor clustering is statistically consistent. Moreover, its worst case complexity is polynomial by construction, and it can be implemented with small average case complexity using branch and bound.
Author Information
Ulrike von Luxburg (University of Tuebingen)
Sebastien Bubeck (MSR)
Stefanie S Jegelka (Max Planck Institute for Intelligent Systems)
Michael Kaufmann
Related Events (a corresponding poster, oral, or spotlight)
-
2007 Poster: Consistent Minimization of Clustering Objective Functions »
Tue. Dec 4th 06:30 -- 06:40 PM Room
More from the Same Authors
-
2022 Poster: Interpolation and Regularization for Causal Learning »
Leena Chennuru Vankadara · Luca Rendsburg · Ulrike Luxburg · Debarghya Ghoshdastidar -
2019 Poster: Foundations of Comparison-Based Hierarchical Clustering »
Debarghya Ghoshdastidar · Michaël Perrot · Ulrike von Luxburg -
2018 Poster: When do random forests fail? »
Cheng Tang · Damien Garreau · Ulrike von Luxburg -
2018 Poster: Measures of distortion for machine learning »
Leena Chennuru Vankadara · Ulrike von Luxburg -
2018 Poster: Practical Methods for Graph Two-Sample Testing »
Debarghya Ghoshdastidar · Ulrike von Luxburg -
2017 : Ordinal distance comparisons: from topology to geometry »
Ulrike von Luxburg -
2017 Poster: Kernel functions based on triplet comparisons »
Matthäus Kleindessner · Ulrike von Luxburg -
2015 Poster: Finite-Time Analysis of Projected Langevin Monte Carlo »
Sebastien Bubeck · Ronen Eldan · Joseph Lehec -
2013 Workshop: Learning Faster From Easy Data »
Peter Grünwald · Wouter M Koolen · Sasha Rakhlin · Nati Srebro · Alekh Agarwal · Karthik Sridharan · Tim van Erven · Sebastien Bubeck -
2013 Workshop: Bayesian Optimization in Theory and Practice »
Matthew Hoffman · Jasper Snoek · Nando de Freitas · Michael A Osborne · Ryan Adams · Sebastien Bubeck · Philipp Hennig · Remi Munos · Andreas Krause -
2013 Poster: Density estimation from unweighted k-nearest neighbor graphs: a roadmap »
Ulrike von Luxburg · Morteza Alamgir -
2013 Poster: Prior-free and prior-dependent regret bounds for Thompson Sampling »
Sebastien Bubeck · Che-Yu Liu -
2012 Session: Oral Session 2 »
Sebastien Bubeck -
2011 Workshop: Discrete Optimization in Machine Learning (DISCML): Uncertainty, Generalization and Feedback »
Andreas Krause · Pradeep Ravikumar · Stefanie S Jegelka · Jeffrey A Bilmes -
2011 Workshop: Relations between machine learning problems - an approach to unify the field »
Robert Williamson · John Langford · Ulrike von Luxburg · Mark Reid · Jennifer Wortman Vaughan -
2011 Poster: Phase transition in the family of p-resistances »
Morteza Alamgir · Ulrike von Luxburg -
2011 Spotlight: Phase transition in the family of p-resistances »
Morteza Alamgir · Ulrike von Luxburg -
2011 Poster: Multi-Bandit Best Arm Identification »
Victor Gabillon · Mohammad Ghavamzadeh · Alessandro Lazaric · Sebastien Bubeck -
2010 Spotlight: Getting lost in space: Large sample analysis of the resistance distance »
Ulrike von Luxburg · Agnes Radl · Matthias Hein -
2010 Poster: Getting lost in space: Large sample analysis of the resistance distance »
Ulrike von Luxburg · Agnes Radl · Matthias Hein -
2009 Workshop: Clustering: Science or art? Towards principled approaches »
Margareta Ackerman · Shai Ben-David · Avrim Blum · Isabelle Guyon · Ulrike von Luxburg · Robert Williamson · Reza Zadeh -
2008 Poster: Online Optimization in X-Armed Bandits »
Sebastien Bubeck · Remi Munos · Gilles Stoltz · Csaba Szepesvari -
2008 Poster: Influence of graph construction on graph-based clustering measures »
Markus M Maier · Ulrike von Luxburg · Matthias Hein -
2008 Oral: Influence of graph construction on graph-based clustering measures »
Markus M Maier · Ulrike von Luxburg · Matthias Hein -
2007 Session: Spotlights »
Ulrike von Luxburg -
2007 Session: Spotlights »
Ulrike von Luxburg