Timezone: »
Poster
A Sparse Non-Parametric Approach for Single Channel Separation of Known Sounds
Paris Smaragdis · Madhusudana Shashanka · Bhiksha Raj
In this paper we present an algorithm for separating mixed sounds from a monophonic recording. Our approach makes use of training data which allows us to learn representations of the types of sounds that compose the mixture. In contrast to popular methods that attempt to extract com- pact generalizable models for each sound from training data, we employ the training data itself as a representation of the sources in the mixture. We show that mixtures of known sounds can be described as sparse com- binations of the training data itself, and in doing so produce significantly better separation results as compared to similar systems based on compact statistical models.
Author Information
Paris Smaragdis (University of Illinois Urbana-Champaign)
Madhusudana Shashanka (Mars Information Services)
Bhiksha Raj (Carnegie Mellon University)
More from the Same Authors
-
2022 Poster: USB: A Unified Semi-supervised Learning Benchmark for Classification »
Yidong Wang · Hao Chen · Yue Fan · Wang SUN · Ran Tao · Wenxin Hou · Renjie Wang · Linyi Yang · Zhi Zhou · Lan-Zhe Guo · Heli Qi · Zhen Wu · Yu-Feng Li · Satoshi Nakamura · Wei Ye · Marios Savvides · Bhiksha Raj · Takahiro Shinozaki · Bernt Schiele · Jindong Wang · Xing Xie · Yue Zhang -
2021 : HEAR 2021: Holistic Evaluation of Audio Representations + Q&A »
Joseph Turian · Jordan Shier · Bhiksha Raj · Bjoern Schuller · Christian Steinmetz · George Tzanetakis · Gissel Velarde · Kirk McNally · Max Henry · Nicolas Pinto · Yonatan Bisk · George Tzanetakis · Camille Noufi · Dorien Herremans · Jesse Engel · Justin Salamon · Prany Manocha · Philippe Esling · Shinji Watanabe -
2020 Poster: Is normalization indispensable for training deep neural network? »
Jie Shao · Kai Hu · Changhu Wang · Xiangyang Xue · Bhiksha Raj -
2020 Oral: Is normalization indispensable for training deep neural network? »
Jie Shao · Kai Hu · Changhu Wang · Xiangyang Xue · Bhiksha Raj -
2019 Poster: Face Reconstruction from Voice using Generative Adversarial Networks »
Yandong Wen · Bhiksha Raj · Rita Singh -
2017 : Poster Session Music and environmental sounds »
Oriol Nieto · Jordi Pons · Bhiksha Raj · Tycho Tax · Benjamin Elizalde · Juhan Nam · Anurag Kumar -
2017 : Poster Session Speech: source separation, enhancement, recognition, synthesis »
Shuayb Zarar · Rasool Fakoor · SRI HARSHA DUMPALA · Minje Kim · Paris Smaragdis · Mohit Dubey · Jong Hwan Ko · Sakriani Sakti · Yuxuan Wang · Lijiang Guo · Garrett T Kenyon · Andros Tjandra · Tycho Tax · Younggun Lee -
2017 : Adaptive Front-ends for End-to-end Source Separation »
Shrikant Venkataramani · Paris Smaragdis -
2014 Poster: Spectral Learning of Mixture of Hidden Markov Models »
Cem Subakan · Johannes Traa · Paris Smaragdis -
2012 Poster: Unsupervised Structure Discovery for Semantic Analysis of Audio »
Sourish Chaudhuri · Bhiksha Raj -
2010 Poster: Multiparty Differential Privacy via Aggregation of Locally Trained Classifiers »
Manas A Pathak · Shantanu Rane · Bhiksha Raj -
2007 Poster: Sparse Overcomplete Latent Variable Decomposition of Counts Data »
Madhusudana Shashanka · Bhiksha Raj · Paris Smaragdis