Timezone: »
We develop a scoring and classification procedure based on the PAC-Bayesian approach and the AUC (Area Under Curve) criterion. We focus initially on the class of linear score functions. We derive PAC-Bayesian non-asymptotic bounds for two types of prior for the score parameters: a Gaussian prior, and a spike-and-slab prior; the latter makes it possible to perform feature selection. One important advantage of our approach is that it is amenable to powerful Bayesian computational tools. We derive in particular a Sequential Monte Carlo algorithm, as an efficient method which may be used as a gold standard, and an Expectation-Propagation algorithm, as a much faster but approximate method. We also extend our method to a class of non-linear score functions, essentially leading to a nonparametric procedure, by considering a Gaussian process prior.
Author Information
James Ridgway (Crest-Ensae and Dauphine)
Pierre Alquier (ENSAE)
Nicolas Chopin (CREST)
Feng Liang (Univ. of Illinois Urbana-Champaign)
More from the Same Authors
-
2019 Poster: Bayesian Joint Estimation of Multiple Graphical Models »
Lingrui Gan · Xinming Yang · Naveen Narisetty · Feng Liang -
2014 Poster: On a Theory of Nonparametric Pairwise Similarity for Clustering: Connecting Clustering to Classification »
Yingzhen Yang · Feng Liang · Shuicheng Yan · Zhangyang Wang · Thomas S Huang -
2009 Poster: Graph-based Consensus Maximization among Multiple Supervised and Unsupervised Models »
Jing Gao · Feng Liang · Wei Fan · Yizhou Sun · Jiawei Han