Timezone: »
Poster
Outlier-Robust Wasserstein DRO
Sloan Nietert · Ziv Goldfeld · Soroosh Shafiee
Distributionally robust optimization (DRO) is an effective approach for data-driven decision-making in the presence of uncertainty. Geometric uncertainty due to~sampling or localized perturbations of data points is captured by Wasserstein DRO (WDRO), which seeks to learn a model that performs uniformly well over a Wasserstein ball centered around the observed data distribution. However, WDRO fails to account for non-geometric perturbations such as adversarial outliers, which can greatly distort the Wasserstein distance measurement and impede the learned model. We address this gap by proposing a novel outlier-robust WDRO framework for decision-making under both geometric (Wasserstein) perturbations and non-geometric (total variation (TV)) contamination that allows an $\varepsilon$-fraction of data to be arbitrarily corrupted. We design an uncertainty set using a certain robust Wasserstein ball that accounts for both perturbation types and derive minimax optimal excess risk bounds for this procedure that explicitly capture the Wasserstein and TV risks. We prove a strong duality result that enables tractable convex reformulations and efficient computation of our outlier-robust WDRO problem. When the loss function depends only on low-dimensional features of the data, we eliminate certain dimension dependencies from the risk bounds that are unavoidable in the general setting. Finally, we present experiments validating our theory on standard regression and classification tasks.
Author Information
Sloan Nietert (Cornell University)
Ziv Goldfeld (Cornell University)
Soroosh Shafiee (Cornell University)
More from the Same Authors
-
2021 Spotlight: Sliced Mutual Information: A Scalable Measure of Statistical Dependence »
Ziv Goldfeld · Kristjan Greenewald -
2023 : Entropic Gromov-Wasserstein Distances: Stability and Algorithms »
Gabriel Rioux · Ziv Goldfeld · Kengo Kato -
2023 : Semi-discrete Gromov-Wasserstein distances: Existence of Gromov-Monge Maps and Statistical Theory »
Gabriel Rioux · Ziv Goldfeld · Kengo Kato -
2023 : Outlier-Robust Wasserstein DRO »
Sloan Nietert · Ziv Goldfeld · Soroosh Shafiee -
2023 : Duality and Sample Complexity for the Gromov-Wasserstein Distance »
Zhengxin Zhang · Ziv Goldfeld · Youssef Mroueh · Bharath Sriperumbudur -
2023 Workshop: Optimal Transport and Machine Learning »
Anna Korba · Aram-Alexandre Pooladian · Charlotte Bunne · David Alvarez-Melis · Marco Cuturi · Ziv Goldfeld -
2023 : Information-Theoretic Generalization Error Bound of Deep Neural Networks »
Haiyun He · Christina Yu · Ziv Goldfeld -
2023 : Information-Theoretic Generalization Error Bound of Deep Neural Networks »
Haiyun He · Christina Yu · Ziv Goldfeld -
2023 Poster: Max-Sliced Mutual Information »
Dor Tsur · Ziv Goldfeld · Kristjan Greenewald -
2022 Poster: $k$-Sliced Mutual Information: A Quantitative Study of Scalability with Dimension »
Ziv Goldfeld · Kristjan Greenewald · Theshani Nuradha · Galen Reeves -
2022 Poster: Statistical, Robustness, and Computational Guarantees for Sliced Wasserstein Distances »
Sloan Nietert · Ziv Goldfeld · Ritwik Sadhu · Kengo Kato -
2021 Poster: Sliced Mutual Information: A Scalable Measure of Statistical Dependence »
Ziv Goldfeld · Kristjan Greenewald -
2020 Poster: Asymptotic Guarantees for Generative Modeling Based on the Smooth Wasserstein Distance »
Ziv Goldfeld · Kristjan Greenewald · Kengo Kato