Track: Orals & Spotlights Track 10: Social/Privacy

Tue 8 Dec. 6:00 - 6:15 PST

Oral

Adversarially Robust Streaming Algorithms via Differential Privacy

Avinatan Hassidim ⋅ Haim Kaplan ⋅ Yishay Mansour ⋅ Yossi Matias ⋅ Uri Stemmer

A streaming algorithm is said to be adversarially robust if its accuracy guarantees are maintained even when the data stream is chosen maliciously, by an adaptive adversary. We establish a connection between adversarial robustness of streaming algorithms and the notion of differential privacy. This connection allows us to design new adversarially robust streaming algorithms that outperform the current state-of-the-art constructions for many interesting regimes of parameters.

Tue 8 Dec. 6:15 - 6:30 PST

Oral

Differentially Private Clustering: Tight Approximation Ratios

Badih Ghazi ⋅ Ravi Kumar ⋅ Pasin Manurangsi

We study the task of differentially private clustering. For several basic clustering problems, including Euclidean DensestBall, 1-Cluster, k-means, and k-median, we give efficient differentially private algorithms that achieve essentially the same approximation ratios as those that can be obtained by any non-private algorithm, while incurring only small additive errors. This improves upon existing efficient algorithms that only achieve some large constant approximation factors.

Our results also imply an improved algorithm for the Sample and Aggregate privacy framework. Furthermore, we show that one of the tools used in our 1-Cluster algorithm can be employed to get a faster quantum algorithm for ClosestPair in a moderate number of dimensions.

Tue 8 Dec. 6:30 - 6:45 PST

Oral

Locally private non-asymptotic testing of discrete distributions is faster using interactive mechanisms

Thomas Berrett ⋅ Cristina Butucea

We find separation rates for testing multinomial or more general discrete distributions under the constraint of alpha-local differential privacy. We construct efficient randomized algorithms and test procedures, in both the case where only non-interactive privacy mechanisms are allowed and also in the case where all sequentially interactive privacy mechanisms are allowed. The separation rates are faster in the latter case. We prove general information theoretical bounds that allow us to establish the optimality of our algorithms among all pairs of privacy mechanisms and test procedures, in most usual cases. Considered examples include testing uniform, polynomially and exponentially decreasing distributions.

Tue 8 Dec. 6:45 - 7:00 PST

Break

Tue 8 Dec. 7:00 - 7:10 PST

Spotlight

Multi-Robot Collision Avoidance under Uncertainty with Probabilistic Safety Barrier Certificates

Wenhao Luo ⋅ Wen Sun ⋅ Ashish Kapoor

Safety in terms of collision avoidance for multi-robot systems is a difficult challenge under uncertainty, non-determinism, and lack of complete information. This paper aims to propose a collision avoidance method that accounts for both measurement uncertainty and motion uncertainty. In particular, we propose Probabilistic Safety Barrier Certificates (PrSBC) using Control Barrier Functions to define the space of admissible control actions that are probabilistically safe with formally provable theoretical guarantee. By formulating the chance constrained safety set into deterministic control constraints with PrSBC, the method entails minimally modifying an existing controller to determine an alternative safe controller via quadratic programming constrained to PrSBC constraints. The key advantage of the approach is that no assumptions about the form of uncertainty are required other than finite support, also enabling worst-case guarantees. We demonstrate effectiveness of the approach through experiments on realistic simulation environments.

Tue 8 Dec. 7:10 - 7:20 PST

Spotlight

Private Identity Testing for High-Dimensional Distributions

Clément L Canonne ⋅ Gautam Kamath ⋅ Audra McMillan ⋅ Jonathan Ullman ⋅ Lydia Zakynthinou

In this work we present novel differentially private identity (goodness-of-fit) testers for natural and widely studied classes of multivariate product distributions: Gaussians in R^d with known covariance and product distributions over {\pm 1}^d. Our testers have improved sample complexity compared to those derived from previous techniques, and are the first testers whose sample complexity matches the order-optimal minimax sample complexity of O(d^1/2/alpha^2) in many parameter regimes. We construct two types of testers, exhibiting tradeoffs between sample complexity and computational complexity. Finally, we provide a two-way reduction between testing a subclass of multivariate product distributions and testing univariate distributions, and thereby obtain upper and lower bounds for testing this subclass of product distributions.

Tue 8 Dec. 7:20 - 7:30 PST

Spotlight

Permute-and-Flip: A new mechanism for differentially private selection

Ryan McKenna ⋅ Daniel Sheldon

We consider the problem of differentially private selection. Given a finite set of candidate items, and a quality score for each item, our goal is to design a differentially private mechanism that returns an item with a score that is as high as possible. The most commonly used mechanism for this task is the exponential mechanism. In this work, we propose a new mechanism for this task based on a careful analysis of the privacy constraints. The expected score of our mechanism is always at least as large as the exponential mechanism, and can offer improvements up to a factor of two. Our mechanism is simple to implement and runs in linear time.

Tue 8 Dec. 7:30 - 7:40 PST

Spotlight

Smoothed Analysis of Online and Differentially Private Learning

Nika Haghtalab ⋅ Tim Roughgarden ⋅ Abhishek Shetty

Practical and pervasive needs for robustness and privacy in algorithms have inspired the design of online adversarial and differentially private learning algorithms. The primary quantity that characterizes learnability in these settings is the Littlestone dimension of the class of hypotheses [Ben-David et al., 2009, Alon et al., 2019]. This characterization is often interpreted as an impossibility result because classes such as linear thresholds and neural networks have infinite Littlestone dimension. In this paper, we apply the framework of smoothed analysis [Spielman and Teng, 2004], in which adversarially chosen inputs are perturbed slightly by nature. We show that fundamentally stronger regret and error guarantees are possible with smoothed adversaries than with worst-case adversaries. In particular, we obtain regret and privacy error bounds that depend only on the VC dimension and the bracketing number of a hypothesis class, and on the magnitudes of the perturbations.

Tue 8 Dec. 7:40 - 7:50 PST

Q&A

Joint Q&A for Preceeding Spotlights

Tue 8 Dec. 7:50 - 8:00 PST

Spotlight

Optimal Private Median Estimation under Minimal Distributional Assumptions

Christos Tzamos ⋅ Emmanouil-Vasileios Vlatakis-Gkaragkounis ⋅ Ilias Zadik

We study the fundamental task of estimating the median of an underlying distribution from a finite number of samples, under pure differential privacy constraints. We focus on distributions satisfying the minimal assumption that they have a positive density at a small neighborhood around the median. In particular, the distribution is allowed to output unbounded values and is not required to have finite moments. We compute the exact, up-to-constant terms, statistical rate of estimation for the median by providing nearly-tight upper and lower bounds. Furthermore, we design a polynomial-time differentially private algorithm which provably achieves the optimal performance. At a technical level, our results leverage a Lipschitz Extension Lemma which allows us to design and analyze differentially private algorithms solely on appropriately defined ``typical" instances of the samples.

Tue 8 Dec. 8:00 - 8:10 PST

Spotlight

Assisted Learning: A Framework for Multi-Organization Learning

Xun Xian ⋅ Xinran Wang ⋅ Jie Ding ⋅ Reza Ghanadan

In an increasing number of AI scenarios, collaborations among different organizations or agents (e.g., human and robots, mobile units) are often essential to accomplish an organization-specific mission. However, to avoid leaking useful and possibly proprietary information, organizations typically enforce stringent security constraints on sharing modeling algorithms and data, which significantly limits collaborations. In this work, we introduce the Assisted Learning framework for organizations to assist each other in supervised learning tasks without revealing any organization's algorithm, data, or even task. An organization seeks assistance by broadcasting task-specific but nonsensitive statistics and incorporating others' feedback in one or more iterations to eventually improve its predictive performance. Theoretical and experimental studies, including real-world medical benchmarks, show that Assisted Learning can often achieve near-oracle learning performance as if data and training processes were centralized.

Tue 8 Dec. 8:10 - 8:20 PST

Spotlight

Higher-Order Certification For Randomized Smoothing

Jeet Mohapatra ⋅ Ching-Yun Ko ⋅ Tsui-Wei Weng ⋅ Pin-Yu Chen ⋅ Sijia Liu ⋅ Luca Daniel

Randomized smoothing is a recently proposed defense against adversarial attacks that has achieved state-of-the-art provable robustness against $\ell_2$ perturbations. A number of works have extended the guarantees to other metrics, such as $\ell_1$ or $\ell_\infty$, by using different smoothing measures. Although the current framework has been shown to yield near-optimal $\ell_p$ radii, the total safety region certified by the current framework can be arbitrarily small compared to the optimal. In this work, we propose a framework to improve the certified safety region for these smoothed classifiers without changing the underlying smoothing scheme. The theoretical contributions are as follows: 1) We generalize the certification for randomized smoothing by reformulating certified radius calculation as a nested optimization problem over a class of functions. 2) We provide a method to calculate the certified safety region using zeroth-order and first-order information for Gaussian-smoothed classifiers. We also provide a framework that generalizes the calculation for certification using higher-order information. 3) We design efficient, high-confidence estimators for the relevant statistics of the first-order information. Combining the theoretical contribution 2) and 3) allows us to certify safety region that are significantly larger than ones provided by the current methods. On CIFAR and Imagenet, the new regions achieve significant improvements on general $\ell_1$ certified radii and on the $\ell_2$ certified radii for color-space attacks ($\ell_2$ perturbation restricted to only one color/channel) while also achieving smaller improvements on the general $\ell_2$ certified radii. As discussed in the future works section, our framework can also provide a way to circumvent the current impossibility results on achieving higher magnitudes of certified radii without requiring the use of data-dependent smoothing techniques.

Tue 8 Dec. 8:20 - 8:30 PST

Q&A

Joint Q&A for Preceeding Spotlights

Tue 8 Dec. 8:30 - 9:00 PST

Break

Main Navigation

Session

Orals & Spotlights Track 10: Social/Privacy

Yanan Sui ⋅ Aurélien Bellet

Adversarially Robust Streaming Algorithms via Differential Privacy

Differentially Private Clustering: Tight Approximation Ratios

Locally private non-asymptotic testing of discrete distributions is faster using interactive mechanisms

Break

Multi-Robot Collision Avoidance under Uncertainty with Probabilistic Safety Barrier Certificates

Private Identity Testing for High-Dimensional Distributions

Permute-and-Flip: A new mechanism for differentially private selection

Smoothed Analysis of Online and Differentially Private Learning

Joint Q&A for Preceeding Spotlights

Optimal Private Median Estimation under Minimal Distributional Assumptions

Assisted Learning: A Framework for Multi-Organization Learning

Higher-Order Certification For Randomized Smoothing

Joint Q&A for Preceeding Spotlights

Break