Timezone: »
In many learning problems, ranging from clustering to ranking through metric learning, empirical estimates of the risk functional consist of an average over tuples (e.g., pairs or triplets) of observations, rather than over individual observations. In this paper, we focus on how to best implement a stochastic approximation approach to solve such risk minimization problems. We argue that in the large-scale setting, gradient estimates should be obtained by sampling tuples of data points with replacement (incomplete U-statistics) instead of sampling data points without replacement (complete U-statistics based on subsamples). We develop a theoretical framework accounting for the substantial impact of this strategy on the generalization ability of the prediction model returned by the Stochastic Gradient Descent (SGD) algorithm. It reveals that the method we promote achieves a much better trade-off between statistical accuracy and computational cost. Beyond the rate bound analysis, experiments on AUC maximization and metric learning provide strong empirical evidence of the superiority of the proposed approach.
Author Information
Guillaume Papa (Telecom paristech)
Stéphan Clémençon (Telecom ParisTech)
Aurélien Bellet (Telecom ParisTech)
More from the Same Authors
-
2022 : Assessing Performance and Fairness Metrics in Face Recognition - Bootstrap Methods »
Jean-Rémy Conti · Stéphan Clémençon -
2017 Poster: Ranking Data with Continuous Labels through Oriented Recursive Partitions »
Stéphan Clémençon · Mastane Achab -
2016 Poster: On Graph Reconstruction via Empirical Risk Minimization: Fast Learning Rates and Scalability »
Guillaume Papa · Aurélien Bellet · Stephan Clémençon -
2015 Poster: Extending Gossip Algorithms to Distributed Estimation of U-statistics »
Igor Colin · Aurélien Bellet · Joseph Salmon · Stéphan Clémençon -
2015 Spotlight: Extending Gossip Algorithms to Distributed Estimation of U-statistics »
Igor Colin · Aurélien Bellet · Joseph Salmon · Stéphan Clémençon -
2011 Poster: On U-processes and clustering performance »
Stéphan Clémençon -
2011 Spotlight: On U-processes and clustering performance »
Stéphan Clémençon -
2009 Poster: AUC optimization and the two-sample problem »
Stéphan Clémençon · Nicolas Vayatis · Marine Depecker