Timezone: »
We propose an algorithm that compresses the critical information of a large dataset into compact addressable memories. These memories can then be recalled to quickly re-train a neural network and recover the performance (instead of storing and re-training on the full original dataset). Building upon the dataset distillation framework, we make a key observation that a shared common representation allows for more efficient and effective distillation. Concretely, we learn a set of bases (aka ``memories'') which are shared between classes and combined through learned flexible addressing functions to generate a diverse set of training examples. This leads to several benefits: 1) the size of compressed data does not necessarily grow linearly with the number of classes; 2) an overall higher compression rate with more effective distillation is achieved; and 3) more generalized queries are allowed beyond recalling the original classes. We demonstrate state-of-the-art results on the dataset distillation task across five benchmarks, including up to 16.5% and 9.7% accuracy improvement when distilling CIFAR10 and CIFAR100 respectively. We then leverage our framework to perform continual learning, achieving state-of-the-art results on four benchmarks, with 23.2% accuracy improvement on MANY.
Author Information
Zhiwei Deng (Princeton University)
Olga Russakovsky (Princeton University)
More from the Same Authors
-
2022 Poster: Enabling Detailed Action Recognition Evaluation Through Video Dataset Augmentation »
Jihoon Chung · Yu Wu · Olga Russakovsky -
2021 : Past and Future of data centric AI »
Olga Russakovsky -
2021 : Live panel: The future of ImageNet »
Matthias Bethge · Vittorio Ferrari · Olga Russakovsky -
2021 : Fairness and privacy aspects of ImageNet »
Olga Russakovsky · Kaiyu Yang -
2020 Poster: Evolving Graphical Planner: Contextual Global Planning for Vision-and-Language Navigation »
Zhiwei Deng · Karthik Narasimhan · Olga Russakovsky -
2014 Workshop: Challenges in Machine Learning workshop (CiML 2014) »
Isabelle Guyon · Evelyne Viegas · Percy Liang · Olga Russakovsky · Rinat Sergeev · Gábor Melis · Michele Sebag · Gustavo Stolovitzky · Jaume Bacardit · Michael S Kim · Ben Hamner -
2006 Poster: Training Conditional Random Fields for Maximum Parse Accuracy »
Samuel Gross · Olga Russakovsky · Chuong B Do · Serafim Batzoglou