Timezone: »
In this paper, we study the problem of fair sparse regression on a biased dataset where bias depends upon a hidden binary attribute. The presence of a hidden attribute adds an extra layer of complexity to the problem by combining sparse regression and clustering with unknown binary labels. The corresponding optimization problem is combinatorial, but we propose a novel relaxation of it as an invex optimization problem. To the best of our knowledge, this is the first invex relaxation for a combinatorial problem. We show that the inclusion of the debiasing/fairness constraint in our model has no adverse effect on the performance. Rather, it enables the recovery of the hidden attribute. The support of our recovered regression parameter vector matches exactly with the true parameter vector. Moreover, we simultaneously solve the clustering problem by recovering the exact value of the hidden attribute for each sample. Our method uses carefully constructed primal dual witnesses to provide theoretical guarantees for the combinatorial problem. To that end, we show that the sample complexity of our method is logarithmic in terms of the dimension of the regression parameter vector.
Author Information
Adarsh Barik (Purdue University)
Jean Honorio (Purdue University)
Related Events (a corresponding poster, oral, or spotlight)
-
2021 Spotlight: Fair Sparse Regression with Clustering: An Invex Relaxation for a Combinatorial Problem »
Dates n/a. Room None
More from the Same Authors
-
2021 Poster: Inverse Reinforcement Learning in a Continuous State Space with Formal Guarantees »
Gregory Dexter · Kevin Bello · Jean Honorio -
2020 Poster: Fairness constraints can help exact inference in structured prediction »
Kevin Bello · Jean Honorio -
2019 Poster: Learning Bayesian Networks with Low Rank Conditional Probability Tables »
Adarsh Barik · Jean Honorio -
2019 Poster: On the Correctness and Sample Complexity of Inverse Reinforcement Learning »
Abi Komanduru · Jean Honorio -
2019 Poster: Exact inference in structured prediction »
Kevin Bello · Jean Honorio -
2018 Poster: Learning latent variable structured prediction models with Gaussian perturbations »
Kevin Bello · Jean Honorio -
2018 Poster: Information-theoretic Limits for Community Detection in Network Models »
Chuyang Ke · Jean Honorio -
2018 Poster: Computationally and statistically efficient learning of causal Bayes nets using path queries »
Kevin Bello · Jean Honorio -
2017 Poster: Learning Identifiable Gaussian Bayesian Networks in Polynomial Time and Sample Complexity »
Asish Ghoshal · Jean Honorio