Timezone: »
Poster
Differentiable Optimization of Generalized Nondecomposable Functions using Linear Programs
Zihang Meng · Lopamudra Mukherjee · Yichao Wu · Vikas Singh · Sathya Ravi
We propose a framework which makes it feasible to directly train deep neural networks with respect to popular families of task-specific non-decomposable performance measures such as AUC, multi-class AUC, $F$-measure and others. A common feature of the optimization model that emerges from these tasks is that it involves solving a Linear Programs (LP) during training where representations learned by upstream layers characterize the constraints or the feasible set. The constraint matrix is not only large but the constraints are also modified at each iteration. We show how adopting a set of ingenious ideas proposed by Mangasarian for 1-norm SVMs -- which advocates for solving LPs with a generalized Newton method -- provides a simple and effective solution that can be run on the GPU. In particular, this strategy needs little unrolling, which makes it more efficient during backward pass. Further, even when the constraint matrix is too large to fit on the GPU memory (say large minibatch settings), we show that running the Newton method in a lower dimensional space yields accurate gradients for training, by utilizing a statistical concept called {\em sufficient} dimension reduction. While a number of specialized algorithms have been proposed for the models that we describe here, our module turns out to be applicable without any specific adjustments or relaxations. We describe each use case, study its properties and demonstrate the efficacy of the approach over alternatives which use surrogate lower bounds and often, specialized optimization schemes. Frequently, we achieve superior computational behavior and performance improvements on common datasets used in the literature.
Author Information
Zihang Meng (University of Wisconsin, Madison)
Lopamudra Mukherjee (University of Wisconsin Whitewater)
Yichao Wu (University of Illinois, Chicago)
Vikas Singh (UW-Madison)
Sathya Ravi (University of Illinois at Chicago)
More from the Same Authors
-
2021 Poster: An Online Riemannian PCA for Stochastic Canonical Correlation Analysis »
Zihang Meng · Rudrasis Chakraborty · Vikas Singh -
2018 Poster: A Statistical Recurrent Model on the Manifold of Symmetric Positive Definite Matrices »
Rudrasis Chakraborty · Chun-Hao Yang · Xingjian Zhen · Monami Banerjee · Derek Archer · David Vaillancourt · Vikas Singh · Baba C Vemuri -
2016 Poster: Hypothesis Testing in Unsupervised Domain Adaptation with Applications in Alzheimer's Disease »
Hao Zhou · Vamsi Ithapu · Sathya Narayanan Ravi · Vikas Singh · Grace Wahba · Sterling C Johnson -
2014 Poster: Permutation Diffusion Maps (PDM) with Application to the Image Association Problem in Computer Vision »
Deepti Pachauri · Risi Kondor · Gautam Sargur · Vikas Singh -
2013 Poster: Speeding up Permutation Testing in Neuroimaging »
Chris Hinrichs · Vamsi Ithapu · Qinyuan Sun · Sterling C Johnson · Vikas Singh -
2013 Spotlight: Speeding up Permutation Testing in Neuroimaging »
Chris Hinrichs · Vamsi Ithapu · Qinyuan Sun · Sterling C Johnson · Vikas Singh -
2013 Poster: Solving the multi-way matching problem by permutation synchronization »
Deepti Pachauri · Risi Kondor · Vikas Singh -
2012 Poster: Wavelet based multi-scale shape features on arbitrary surfaces for cortical thickness discrimination »
Won Hwa Kim · Deepti Pachauri · Charles R Hatt · Moo. K Chung · Sterling C Johnson · Vikas Singh -
2012 Poster: Q-MKL: Matrix-induced Regularization in Multi-Kernel Learning with Applications to Neuroimaging »
Chris Hinrichs · Vikas Singh · Jiming Peng · Sterling C Johnson -
2010 Spotlight: Epitome driven 3-D Diffusion Tensor image segmentation: on extracting specific structures »
Kamiya Motwani · Nagesh Adluru · Chris Hinrichs · Vikas Singh -
2010 Poster: Epitome driven 3-D Diffusion Tensor image segmentation: on extracting specific structures »
Kamiya Motwani · Nagesh Adluru · Chris Hinrichs · andrew L Alexander · Vikas Singh -
2007 Spotlight: Ensemble Clustering using Semidefinite Programming »
Vikas Singh · Lopamudra Mukherjee · Jiming Peng · Jinhui Xu -
2007 Poster: Ensemble Clustering using Semidefinite Programming »
Vikas Singh · Lopamudra Mukherjee · Jiming Peng · Jinhui Xu