Timezone: »
Successful applications of InfoNCE (Information Noise-Contrastive Estimation) and its variants have popularized the use of contrastive variational mutual information (MI) estimators in machine learning . While featuring superior stability, these estimators crucially depend on costly large-batch training, and they sacrifice bound tightness for variance reduction. To overcome these limitations, we revisit the mathematics of popular variational MI bounds from the lens of unnormalized statistical modeling and convex optimization. Our investigation yields a new unified theoretical framework encompassing popular variational MI bounds, and leads to a novel, simple, and powerful contrastive MI estimator we name FLO. Theoretically, we show that the FLO estimator is tight, and it converges under stochastic gradient descent. Empirically, the proposed FLO estimator overcomes the limitations of its predecessors and learns more efficiently. The utility of FLO is verified using extensive benchmarks, and we further inspire the community with novel applications in meta-learning. Our presentation underscores the foundational importance of variational MI estimation in data-efficient learning.
Author Information
Qing Guo (Virginia Tech)
Junya Chen (Duke University)
Dong Wang (Duke University)
Yuewei Yang (Duke University)
Xinwei Deng (Virginia Tech)
Jing Huang (JD AI Research)
Larry Carin
Fan Li (Duke University)
Chenyang Tao (Amazon)
More from the Same Authors
-
2021 Spotlight: Supercharging Imbalanced Data Learning With Energy-based Contrastive Representation Transfer »
Junya Chen · Zidi Xiu · Benjamin Goldstein · Ricardo Henao · Lawrence Carin · Chenyang Tao -
2022 : Weakly Supervised Data Augmentation Through Prompting for Dialogue Understanding »
Maximillian Chen · Alexandros Papangelis · Chenyang Tao · Andy Rosenbaum · Seokhwan Kim · Yang Liu · Zhou Yu · Dilek Hakkani-Tur -
2021 Poster: Supercharging Imbalanced Data Learning With Energy-based Contrastive Representation Transfer »
Junya Chen · Zidi Xiu · Benjamin Goldstein · Ricardo Henao · Lawrence Carin · Chenyang Tao -
2020 Poster: Reconsidering Generative Objectives For Counterfactual Reasoning »
Danni Lu · Chenyang Tao · Junya Chen · Fan Li · Feng Guo · Lawrence Carin -
2019 Poster: Improving Textual Network Learning with Variational Homophilic Embeddings »
Wenlin Wang · Chenyang Tao · Zhe Gan · Guoyin Wang · Liqun Chen · Xinyuan Zhang · Ruiyi Zhang · Qian Yang · Ricardo Henao · Lawrence Carin -
2019 Poster: On Fenchel Mini-Max Learning »
Chenyang Tao · Liqun Chen · Shuyang Dai · Junya Chen · Ke Bai · Dong Wang · Jianfeng Feng · Wenlian Lu · Georgiy Bobashev · Lawrence Carin -
2018 Poster: Adversarial Text Generation via Feature-Mover's Distance »
Liqun Chen · Shuyang Dai · Chenyang Tao · Haichao Zhang · Zhe Gan · Dinghan Shen · Yizhe Zhang · Guoyin Wang · Dinghan Shen · Lawrence Carin -
2009 Oral: Non-Parametric Bayesian Dictionary Learning for Sparse Image Representations »
Mingyuan Zhou · Haojun Chen · John Paisley · Lu Ren · Guillermo Sapiro · Larry Carin