Timezone: »
Poster
$k$-Sliced Mutual Information: A Quantitative Study of Scalability with Dimension
Ziv Goldfeld · Kristjan Greenewald · Theshani Nuradha · Galen Reeves
Sliced mutual information (SMI) is defined as an average of mutual information (MI) terms between one-dimensional random projections of the random variables. It serves as a surrogate measure of dependence to classic MI that preserves many of its properties but is more scalable to high dimensions. However, a quantitative characterization of how SMI itself and estimation rates thereof depend on the ambient dimension, which is crucial to the understanding of scalability, remain obscure. This work provides a multifaceted account of the dependence of SMI on dimension, under a broader framework termed $k$-SMI, which considers projections to $k$-dimensional subspaces. Using a new result on the continuity of differential entropy in the 2-Wasserstein metric, we derive sharp bounds on the error of Monte Carlo (MC)-based estimates of $k$-SMI, with explicit dependence on $k$ and the ambient dimension, revealing their interplay with the number of samples. We then combine the MC integrator with the neural estimation framework to provide an end-to-end $k$-SMI estimator, for which optimal convergence rates are established. We also explore asymptotics of the population $k$-SMI as dimension grows, providing Gaussian approximation results with a residual that decays under appropriate moment bounds. All our results trivially apply to SMI by setting $k=1$. Our theory is validated with numerical experiments and is applied to sliced InfoGAN, which altogether provide a comprehensive quantitative account of the scalability question of $k$-SMI, including SMI as a special case when $k=1$.
Author Information
Ziv Goldfeld (Cornell University)
Kristjan Greenewald (MIT-IBM Watson AI Lab; IBM Research)
Theshani Nuradha (Cornell University)
Galen Reeves (Duke University)
More from the Same Authors
-
2021 Spotlight: Sliced Mutual Information: A Scalable Measure of Statistical Dependence »
Ziv Goldfeld · Kristjan Greenewald -
2022 Poster: Statistical, Robustness, and Computational Guarantees for Sliced Wasserstein Distances »
Sloan Nietert · Ziv Goldfeld · Ritwik Sadhu · Kengo Kato -
2021 Poster: Sliced Mutual Information: A Scalable Measure of Statistical Dependence »
Ziv Goldfeld · Kristjan Greenewald -
2020 Poster: Asymptotic Guarantees for Generative Modeling Based on the Smooth Wasserstein Distance »
Ziv Goldfeld · Kristjan Greenewald · Kengo Kato -
2020 Poster: Active Structure Learning of Causal DAGs via Directed Clique Trees »
Chandler Squires · Sara Magliacane · Kristjan Greenewald · Dmitriy Katz · Murat Kocaoglu · Karthikeyan Shanmugam -
2020 Poster: Entropic Causal Inference: Identifiability and Finite Sample Results »
Spencer Compton · Murat Kocaoglu · Kristjan Greenewald · Dmitriy Katz -
2019 Poster: Statistical Model Aggregation via Parameter Matching »
Mikhail Yurochkin · Mayank Agarwal · Soumya Ghosh · Kristjan Greenewald · Nghia Hoang -
2019 Poster: Sample Efficient Active Learning of Causal Trees »
Kristjan Greenewald · Dmitriy Katz · Karthikeyan Shanmugam · Sara Magliacane · Murat Kocaoglu · Enric Boix-Adsera · Guy Bresler