Timezone: »
We consider the problem of aggregating models learned from sequestered, possibly heterogeneous datasets. Exploiting tools from Bayesian nonparametrics, we develop a general meta-modeling framework that learns shared global latent structures by identifying correspondences among local model parameterizations. Our proposed framework is model-independent and is applicable to a wide range of model types. After verifying our approach on simulated data, we demonstrate its utility in aggregating Gaussian topic models, hierarchical Dirichlet process based hidden Markov models, and sparse Gaussian processes with applications spanning text summarization, motion capture analysis, and temperature forecasting.
Author Information
Mikhail Yurochkin (IBM Research, MIT-IBM Watson AI Lab)
Mayank Agarwal (IBM Research AI, MIT-IBM Watson AI Lab)
Soumya Ghosh (IBM Research)
Kristjan Greenewald (IBM Research)
Nghia Hoang (IBM Research)
More from the Same Authors
-
2021 : Measuring the sensitivity of Gaussian processes to kernel choice »
Will Stephenson · Soumya Ghosh · Tin Nguyen · Mikhail Yurochkin · Sameer Deshpande · Tamara Broderick -
2021 : COVID-19 India Dataset: Parsing Detailed COVID-19 Data in Daily Health Bulletins from States in India »
Mayank Agarwal · Tathagata Chakraborti · Sachin Grover -
2022 : Are you using test log-likelihood correctly? »
Sameer Deshpande · Soumya Ghosh · Tin Nguyen · Tamara Broderick -
2023 Poster: Identifiability Guarantees for Causal Disentanglement from Soft Interventions »
Jiaqi Zhang · Kristjan Greenewald · Chandler Squires · Akash Srivastava · Karthikeyan Shanmugam · Caroline Uhler -
2023 Poster: Post-processing Private Synthetic Data for Improving Utility on Selected Measures »
Hao Wang · Shivchander Sudalairaj · John Henning · Kristjan Greenewald · Akash Srivastava -
2023 Poster: Max-Sliced Mutual Information »
Dor Tsur · Ziv Goldfeld · Kristjan Greenewald -
2022 Poster: $k$-Sliced Mutual Information: A Quantitative Study of Scalability with Dimension »
Ziv Goldfeld · Kristjan Greenewald · Theshani Nuradha · Galen Reeves -
2022 Poster: Fair Infinitesimal Jackknife: Mitigating the Influence of Biased Training Data Points Without Refitting »
Prasanna Sattigeri · Soumya Ghosh · Inkit Padhi · Pierre Dognin · Kush Varshney -
2021 Poster: Does enforcing fairness mitigate biases caused by subpopulation shift? »
Subha Maity · Debarghya Mukherjee · Mikhail Yurochkin · Yuekai Sun -
2021 Poster: Post-processing for Individual Fairness »
Felix Petersen · Debarghya Mukherjee · Yuekai Sun · Mikhail Yurochkin -
2021 Poster: On sensitivity of meta-learning to support data »
Mayank Agarwal · Mikhail Yurochkin · Yuekai Sun -
2020 : NLC2CMD Competition Organizers: Metrics, Data, Tracks »
Mayank Agarwal -
2020 : NLC2CMD Competition Keynote: Tellina »
Victoria Lin · Mayank Agarwal · Tathagata Chakraborti -
2020 : NLC2CMD Competition Organizers: Introduction, Problem Description, CLAI »
Mayank Agarwal -
2020 : Spotlight Session 1 »
Augustus Odena · Maxwell Nye · Disha Shrivastava · Mayank Agarwal · Vincent J Hellendoorn · Charles Sutton -
2020 Poster: Asymptotic Guarantees for Generative Modeling Based on the Smooth Wasserstein Distance »
Ziv Goldfeld · Kristjan Greenewald · Kengo Kato -
2020 Poster: Active Structure Learning of Causal DAGs via Directed Clique Trees »
Chandler Squires · Sara Magliacane · Kristjan Greenewald · Dmitriy Katz · Murat Kocaoglu · Karthikeyan Shanmugam -
2020 Poster: Continuous Regularized Wasserstein Barycenters »
Lingxiao Li · Aude Genevay · Mikhail Yurochkin · Justin Solomon -
2020 Poster: Approximate Cross-Validation for Structured Models »
Soumya Ghosh · Will Stephenson · Tin Nguyen · Sameer Deshpande · Tamara Broderick -
2020 Poster: Entropic Causal Inference: Identifiability and Finite Sample Results »
Spencer Compton · Murat Kocaoglu · Kristjan Greenewald · Dmitriy Katz -
2020 Demonstration: IBM Federated Learning Community Edition: An Interactive Demonstration »
Laura Wynter · Chaitanya Kumar · Pengqian Yu · Mikhail Yurochkin · Amogh Tarcar -
2019 Poster: Alleviating Label Switching with Optimal Transport »
Pierre Monteiller · Sebastian Claici · Edward Chien · Farzaneh Mirzazadeh · Justin Solomon · Mikhail Yurochkin -
2019 Poster: Hierarchical Optimal Transport for Document Representation »
Mikhail Yurochkin · Sebastian Claici · Edward Chien · Farzaneh Mirzazadeh · Justin Solomon -
2019 Demonstration: Project BB: Bringing AI to the Command Line »
Tathagata Chakraborti · Mayank Agarwal -
2019 Poster: Scalable inference of topic evolution via models for latent geometric structures »
Mikhail Yurochkin · Zhiwei Fan · Aritra Guha · Paraschos Koutris · XuanLong Nguyen -
2019 Poster: Sample Efficient Active Learning of Causal Trees »
Kristjan Greenewald · Dmitriy Katz · Karthikeyan Shanmugam · Sara Magliacane · Murat Kocaoglu · Enric Boix-Adsera · Guy Bresler -
2012 Poster: From Deformations to Parts: Motion-based Segmentation of 3D Objects »
Soumya Ghosh · Erik Sudderth · Matthew Loper · Michael J Black -
2011 Poster: Spatial distance dependent Chinese Restaurant Process for image segmentation »
Soumya Ghosh · Andrei B Ungureanu · Erik Sudderth · David Blei