Timezone: »
Non-negative matrix factorization is a popular tool for decomposing data into feature and weight matrices under non-negativity constraints. It enjoys practical success but is poorly understood theoretically. This paper proposes an algorithm that alternates between decoding the weights and updating the features, and shows that assuming a generative model of the data, it provably recovers the ground-truth under fairly mild conditions. In particular, its only essential requirement on features is linear independence. Furthermore, the algorithm uses ReLU to exploit the non-negativity for decoding the weights, and thus can tolerate adversarial noise that can potentially be as large as the signal, and can tolerate unbiased noise much larger than the signal. The analysis relies on a carefully designed coupling between two potential functions, which we believe is of independent interest.
Author Information
Yuanzhi Li (Princeton University)
Yingyu Liang (Princeton University)
Andrej Risteski (Princeton University)
More from the Same Authors
-
2021 Poster: Detecting Errors and Estimating Accuracy on Unlabeled Data with Self-training Ensembles »
Jiefeng Chen · Frederick Liu · Besim Avci · Xi Wu · Yingyu Liang · Somesh Jha -
2017 Poster: Convergence Analysis of Two-layer Neural Networks with ReLU Activation »
Yuanzhi Li · Yang Yuan -
2017 Poster: Linear Convergence of a Frank-Wolfe Type Algorithm over Trace-Norm Balls »
Zeyuan Allen-Zhu · Elad Hazan · Wei Hu · Yuanzhi Li -
2017 Spotlight: Linear Convergence of a Frank-Wolfe Type Algorithm over Trace-Norm Balls »
Zeyuan Allen-Zhu · Elad Hazan · Wei Hu · Yuanzhi Li -
2016 Poster: Approximate maximum entropy principles via Goemans-Williamson with applications to provable variational methods »
Andrej Risteski · Yuanzhi Li -
2016 Poster: Even Faster SVD Decomposition Yet Without Agonizing Pain »
Zeyuan Allen-Zhu · Yuanzhi Li -
2016 Poster: Algorithms and matching lower bounds for approximately-convex optimization »
Andrej Risteski · Yuanzhi Li -
2015 Poster: Scale Up Nonlinear Component Analysis with Doubly Stochastic Gradients »
Bo Xie · Yingyu Liang · Le Song -
2015 Poster: On some provably correct cases of variational inference for topic models »
Pranjal Awasthi · Andrej Risteski -
2015 Spotlight: On some provably correct cases of variational inference for topic models »
Pranjal Awasthi · Andrej Risteski -
2014 Poster: Improved Distributed Principal Component Analysis »
Yingyu Liang · Maria-Florina F Balcan · Vandana Kanchanapally · David Woodruff -
2014 Poster: Learning Time-Varying Coverage Functions »
Nan Du · Yingyu Liang · Maria-Florina F Balcan · Le Song -
2014 Poster: Scalable Kernel Methods via Doubly Stochastic Gradients »
Bo Dai · Bo Xie · Niao He · Yingyu Liang · Anant Raj · Maria-Florina F Balcan · Le Song -
2013 Poster: Distributed k-means and k-median clustering on general communication topologies »
Maria-Florina F Balcan · Steven Ehrlich · Yingyu Liang