NeurIPS 2020 : Multi-label Contrastive Predictive Coding



Multi-label Contrastive Predictive Coding

Jiaming Song, Stefano Ermon

Oral presentation: Orals & Spotlights Track 01: Representation/Relational
on Tue, Dec 8th, 2020 @ 02:15 – 02:30 GMT

Poster Session 1 (more posters)
on Tue, Dec 8th, 2020 @ 05:00 – 07:00 GMT

Toggle Abstract Paper (in Proceedings / .pdf)

Abstract: Variational mutual information (MI) estimators are widely used in unsupervised representation learning methods such as contrastive predictive coding (CPC). A lower bound on MI can be obtained from a multi-class classification problem, where a critic attempts to distinguish a positive sample drawn from the underlying joint distribution from (m-1) negative samples drawn from a suitable proposal distribution. Using this approach, MI estimates are bounded above by \log m, and could thus severely underestimate unless m is very large. To overcome this limitation, we introduce a novel estimator based on a multi-label classification problem, where the critic needs to jointly identify \emph{multiple} positive samples at the same time. We show that using the same amount of negative samples, multi-label CPC is able to exceed the \log m bound, while still being a valid lower bound of mutual information. We demonstrate that the proposed approach is able to lead to better mutual information estimation, gain empirical improvements in unsupervised representation learning, and beat the current state-of-the-art in knowledge distillation over 10 out of 13 tasks.

Multi-label Contrastive Predictive Coding

Jiaming Song, Stefano Ermon

Preview Video and Chat

To see video, interact with the author and ask questions please use registration and login.