This tutorial surveys methodology and theory for high-dimensional statistical inference when the number of variables or features greatly exceeds sample size. Particular emphasis will be placed on problems of model and feature selection. This includes variable selection in regression models or estimation of the edge set in graphical modeling. While the former is concerned with association, the latter can be used for causal analysis. In the high-dimensional setting, major challenges include designing computational algorithms that are feasible for large-scale problems, assigning statistical error rates (e.g., p-values), and developing theoretical insights about the limits of what is possible. We will present some of the most important recent developments and discuss their implications for prediction, association analysis and some exciting new directions in causal inference.
Peter Bühlmann (ETH Zurich)
Peter Bühlmann is Professor of Statistics at the ETH Z"urich. His research interests are in computational statistics, high-dimensional statistical inference, machine learning and applications in the life sciences. He is a Fellow of the Institute of Mathematical Statistics, an elected Member of the International Statistical Institute and he presented a Medallion Lecture at the JSM 2009. He has served as editorial member of the Journal of the Royal Statistical Society (Series B), Journal of Machine Learning Research, Biometrical Journal and he is currently the Editor of the Annals of Statistics.