Timezone: »
We present an approach for lifelong/continual learning of convolutional neural networks (CNN) that does not suffer from the problem of catastrophic forgetting when moving from one task to the other. We show that the activation maps generated by the CNN trained on the old task can be calibrated using very few calibration parameters, to become relevant to the new task. Based on this, we calibrate the activation maps produced by each network layer using spatial and channel-wise calibration modules and train only these calibration parameters for each new task in order to perform lifelong learning. Our calibration modules introduce significantly less computation and parameters as compared to the approaches that dynamically expand the network. Our approach is immune to catastrophic forgetting since we store the task-adaptive calibration parameters, which contain all the task-specific knowledge and is exclusive to each task. Further, our approach does not require storing data samples from the old tasks, which is done by many replay based methods. We perform extensive experiments on multiple benchmark datasets (SVHN, CIFAR, ImageNet, and MS-Celeb), all of which show substantial improvements over state-of-the-art methods (e.g., a 29% absolute increase in accuracy on CIFAR-100 with 10 classes at a time). On large-scale datasets, our approach yields 23.8% and 9.7% absolute increase in accuracy on ImageNet-100 and MS-Celeb-10K datasets, respectively, by employing very few (0.51% and 0.35% of model parameters) task-adaptive calibration parameters.
Author Information
Pravendra Singh (Indian Institute of Technology Kanpur)
Vinay Kumar Verma (Indian Institute of Technology Kanpur)
Pratik Mazumder (Indian Institute of Technology, Kanpur)
Lawrence Carin (Duke University)
Piyush Rai (IIT Kanpur)
More from the Same Authors
-
2021 Spotlight: Supercharging Imbalanced Data Learning With Energy-based Contrastive Representation Transfer »
Junya Chen · Zidi Xiu · Benjamin Goldstein · Ricardo Henao · Lawrence Carin · Chenyang Tao -
2021 : VAEs meet Diffusion Models: Efficient and High-Fidelity Generation »
Kushagra Pandey · Avideep Mukherjee · Piyush Rai · Abhishek Kumar -
2022 : CAM-GAN: Continual Adaptation Modules for Generative Adversarial Networks »
Sakshi Varshney · Vinay Verma · Srijith PK · Piyush Rai · Lawrence Carin -
2021 : NeurInt-Learning Interpolation by Neural ODEs »
Avinandan Bose · Aniket Das · Yatin Dandi · Piyush Rai -
2021 : VAEs meet Diffusion Models: Efficient and High-Fidelity Generation »
Kushagra Pandey · Avideep Mukherjee · Piyush Rai · Abhishek Kumar -
2021 Poster: Supercharging Imbalanced Data Learning With Energy-based Contrastive Representation Transfer »
Junya Chen · Zidi Xiu · Benjamin Goldstein · Ricardo Henao · Lawrence Carin · Chenyang Tao -
2021 Poster: CAM-GAN: Continual Adaptation Modules for Generative Adversarial Networks »
Sakshi Varshney · Vinay Kumar Verma · P. K. Srijith · Lawrence Carin · Piyush Rai -
2020 Poster: GAN Memory with No Forgetting »
Yulai Cong · Miaoyun Zhao · Jianqiao Li · Sijia Wang · Lawrence Carin -
2020 Poster: Reconsidering Generative Objectives For Counterfactual Reasoning »
Danni Lu · Chenyang Tao · Junya Chen · Fan Li · Feng Guo · Lawrence Carin -
2020 Poster: AutoSync: Learning to Synchronize for Data-Parallel Distributed Deep Learning »
Hao Zhang · Yuan Li · Zhijie Deng · Xiaodan Liang · Lawrence Carin · Eric Xing -
2020 Poster: Perturbing Across the Feature Hierarchy to Improve Standard and Strict Blackbox Attack Transferability »
Nathan Inkawhich · Kevin J Liang · Binghui Wang · Matthew Inkawhich · Lawrence Carin · Yiran Chen -
2019 Poster: Improving Textual Network Learning with Variational Homophilic Embeddings »
Wenlin Wang · Chenyang Tao · Zhe Gan · Guoyin Wang · Liqun Chen · Xinyuan Zhang · Ruiyi Zhang · Qian Yang · Ricardo Henao · Lawrence Carin -
2019 Poster: Ouroboros: On Accelerating Training of Transformer-Based Language Models »
Qian Yang · Zhouyuan Huo · Wenlin Wang · Lawrence Carin -
2019 Poster: Scalable Gromov-Wasserstein Learning for Graph Partitioning and Matching »
Hongteng Xu · Dixin Luo · Lawrence Carin -
2019 Poster: Kernel-Based Approaches for Sequence Modeling: Connections to Neural Methods »
Kevin J Liang · Guoyin Wang · Yitong Li · Ricardo Henao · Lawrence Carin -
2019 Poster: Certified Adversarial Robustness with Additive Noise »
Bai Li · Changyou Chen · Wenlin Wang · Lawrence Carin -
2019 Poster: On Fenchel Mini-Max Learning »
Chenyang Tao · Liqun Chen · Shuyang Dai · Junya Chen · Ke Bai · Dong Wang · Jianfeng Feng · Wenlian Lu · Georgiy Bobashev · Lawrence Carin -
2018 Poster: Adversarial Text Generation via Feature-Mover's Distance »
Liqun Chen · Shuyang Dai · Chenyang Tao · Haichao Zhang · Zhe Gan · Dinghan Shen · Yizhe Zhang · Guoyin Wang · Dinghan Shen · Lawrence Carin -
2018 Poster: Distilled Wasserstein Learning for Word Embedding and Topic Modeling »
Hongteng Xu · Wenlin Wang · Wei Liu · Lawrence Carin -
2018 Poster: Diffusion Maps for Textual Network Embedding »
Xinyuan Zhang · Yitong Li · Dinghan Shen · Lawrence Carin -
2018 Spotlight: Diffusion Maps for Textual Network Embedding »
Xinyuan Zhang · Yitong Li · Dinghan Shen · Lawrence Carin -
2017 Spotlight: Targeting EEG/LFP Synchrony with Neural Nets »
Yitong Li · michael Murias · samantha Major · geraldine Dawson · Kafui Dzirasa · Lawrence Carin · David Carlson -
2017 Poster: Targeting EEG/LFP Synchrony with Neural Nets »
Yitong Li · michael Murias · samantha Major · geraldine Dawson · Kafui Dzirasa · Lawrence Carin · David Carlson -
2017 Poster: Triangle Generative Adversarial Networks »
Zhe Gan · Liqun Chen · Weiyao Wang · Yuchen Pu · Yizhe Zhang · Hao Liu · Chunyuan Li · Lawrence Carin -
2017 Poster: ALICE: Towards Understanding Adversarial Learning for Joint Distribution Matching »
Chunyuan Li · Hao Liu · Changyou Chen · Yuchen Pu · Liqun Chen · Ricardo Henao · Lawrence Carin -
2017 Poster: An inner-loop free solution to inverse problems using deep neural networks »
Kai Fan · Qi Wei · Lawrence Carin · Katherine Heller -
2017 Poster: VAE Learning via Stein Variational Gradient Descent »
Yuchen Pu · Zhe Gan · Ricardo Henao · Chunyuan Li · Shaobo Han · Lawrence Carin -
2017 Poster: Deconvolutional Paragraph Representation Learning »
Yizhe Zhang · Dinghan Shen · Guoyin Wang · Zhe Gan · Ricardo Henao · Lawrence Carin -
2017 Poster: Adversarial Symmetric Variational Autoencoder »
Yuchen Pu · Weiyao Wang · Ricardo Henao · Liqun Chen · Zhe Gan · Chunyuan Li · Lawrence Carin -
2017 Poster: A Probabilistic Framework for Nonlinearities in Stochastic Neural Networks »
Qinliang Su · xuejun Liao · Lawrence Carin -
2017 Poster: Scalable Model Selection for Belief Networks »
Zhao Song · Yusuke Muraoka · Ryohei Fujimaki · Lawrence Carin -
2017 Poster: Cross-Spectral Factor Analysis »
Neil Gallagher · Kyle Ulrich · Austin Talbot · Kafui Dzirasa · Lawrence Carin · David Carlson -
2016 Poster: Towards Unifying Hamiltonian Monte Carlo and Slice Sampling »
Yizhe Zhang · Xiangyu Wang · Changyou Chen · Ricardo Henao · Kai Fan · Lawrence Carin -
2016 Poster: Variational Autoencoder for Deep Learning of Images, Labels and Captions »
Yunchen Pu · Zhe Gan · Ricardo Henao · Xin Yuan · Chunyuan Li · Andrew Stevens · Lawrence Carin -
2016 Poster: Linear Feature Encoding for Reinforcement Learning »
Zhao Song · Ronald Parr · Xuejun Liao · Lawrence Carin -
2016 Poster: Stochastic Gradient MCMC with Stale Gradients »
Changyou Chen · Nan Ding · Chunyuan Li · Yizhe Zhang · Lawrence Carin -
2015 Poster: GP Kernels for Cross-Spectrum Analysis »
Kyle R Ulrich · David Carlson · Kafui Dzirasa · Lawrence Carin -
2015 Poster: Deep Poisson Factor Modeling »
Ricardo Henao · Zhe Gan · James Lu · Lawrence Carin -
2015 Poster: Preconditioned Spectral Descent for Deep Learning »
David Carlson · Edo Collins · Ya-Ping Hsieh · Lawrence Carin · Volkan Cevher -
2015 Poster: Large-Scale Bayesian Multi-Label Learning via Topic-Based Label Embeddings »
Piyush Rai · Changwei Hu · Ricardo Henao · Lawrence Carin -
2015 Spotlight: Large-Scale Bayesian Multi-Label Learning via Topic-Based Label Embeddings »
Piyush Rai · Changwei Hu · Ricardo Henao · Lawrence Carin -
2015 Poster: On the Convergence of Stochastic Gradient MCMC Algorithms with High-Order Integrators »
Changyou Chen · Nan Ding · Lawrence Carin -
2015 Poster: Deep Temporal Sigmoid Belief Networks for Sequence Modeling »
Zhe Gan · Chunyuan Li · Ricardo Henao · David Carlson · Lawrence Carin -
2014 Poster: Analysis of Brain States from Multi-Region LFP Time-Series »
Kyle R Ulrich · David Carlson · Wenzhao Lian · Jana S Borg · Kafui Dzirasa · Lawrence Carin -
2014 Poster: Bayesian Nonlinear Support Vector Machines and Discriminative Factor Modeling »
Ricardo Henao · Xin Yuan · Lawrence Carin -
2014 Poster: Compressive Sensing of Signals from a GMM with Sparse Precision Matrices »
Jianbo Yang · Xuejun Liao · Minhua Chen · Lawrence Carin -
2014 Poster: On the relations of LFPs & Neural Spike Trains »
David Carlson · Jana Schaich Borg · Kafui Dzirasa · Lawrence Carin -
2014 Poster: Dynamic Rank Factor Model for Text Streams »
Shaobo Han · Lin Du · Esther Salazar · Lawrence Carin -
2013 Poster: Dynamic Clustering via Asymptotics of the Dependent Dirichlet Process Mixture »
Trevor Campbell · Miao Liu · Brian Kulis · Jonathan How · Lawrence Carin -
2013 Poster: Designed Measurements for Vector Count Data »
Liming Wang · David Carlson · Miguel Rodrigues · David Wilcox · Robert Calderbank · Lawrence Carin -
2013 Poster: Integrated Non-Factorized Variational Inference »
Shaobo Han · Xuejun Liao · Lawrence Carin -
2013 Poster: Real-Time Inference for a Gamma Process Model of Neural Spiking »
David Carlson · Vinayak Rao · Joshua T Vogelstein · Lawrence Carin -
2012 Workshop: Bayesian Nonparametric Models For Reliable Planning And Decision-Making Under Uncertainty »
Jonathan How · Lawrence Carin · John Fisher III · Michael Jordan · Alborz Geramifard -
2012 Poster: Joint Modeling of a Matrix with Associated Text via Latent Binary Features »
XianXing Zhang · Lawrence Carin -
2012 Poster: Augment-and-Conquer Negative Binomial Processes »
Mingyuan Zhou · Lawrence Carin -
2012 Spotlight: Augment-and-Conquer Negative Binomial Processes »
Mingyuan Zhou · Lawrence Carin -
2011 Poster: On the Analysis of Multi-Channel Neural Spike Data »
Bo Chen · David Carlson · Lawrence Carin -
2011 Poster: The Kernel Beta Process »
Lu Ren · Yingjian Wang · David B Dunson · Lawrence Carin -
2011 Spotlight: The Kernel Beta Process »
Lu Ren · Yingjian Wang · David B Dunson · Lawrence Carin -
2011 Poster: Hierarchical Topic Modeling for Analysis of Time-Evolving Personal Choices »
XianXing Zhang · David B Dunson · Lawrence Carin -
2010 Poster: Joint Analysis of Time-Evolving Binary Matrices and Associated Documents »
Eric X Wang · Dehong Liu · Jorge G Silva · David B Dunson · Lawrence Carin -
2009 Poster: A Bayesian Model for Simultaneous Image Clustering, Annotation and Object Segmentation »
Lan Du · Lu Ren · David B Dunson · Lawrence Carin -
2009 Poster: Non-Parametric Bayesian Dictionary Learning for Sparse Image Representations »
Mingyuan Zhou · Haojun Chen · John Paisley · Lu Ren · Guillermo Sapiro · Lawrence Carin -
2009 Poster: Learning to Explore and Exploit in POMDPs »
Chenghui Cai · Xuejun Liao · Lawrence Carin -
2008 Workshop: Cost Sensitive Learning »
Balaji R Krishnapuram · Shipeng Yu · Oksana Yakhnenko · R. Bharat Rao · Lawrence Carin -
2007 Poster: Semi-Supervised Multitask Learning »
Qiuhua Liu · Xuejun Liao · Lawrence Carin -
2007 Spotlight: Semi-Supervised Multitask Learning »
Qiuhua Liu · Xuejun Liao · Lawrence Carin