Timezone: »

Latent Gaussian Activity Propagation: Using Smoothness and Structure to Separate and Localize Sounds in Large Noisy Environments
Daniel Johnson · Daniel Gorelik · Ross E Mawhorter · Kyle Suver · Weiqing Gu · Steven Xing · Cody Gabriel · Peter Sankhagowit

Tue Dec 04 02:00 PM -- 04:00 PM (PST) @ Room 517 AB #112

We present an approach for simultaneously separating and localizing multiple sound sources using recorded microphone data. Inspired by topic models, our approach is based on a probabilistic model of inter-microphone phase differences, and poses separation and localization as a Bayesian inference problem. We assume sound activity is locally smooth across time, frequency, and location, and use the known position of the microphones to obtain a consistent separation. We compare the performance of our method against existing algorithms on simulated anechoic voice data and find that it obtains high performance across a variety of input conditions.

Author Information

Daniel Johnson (Harvey Mudd College)
Daniel Gorelik (Harvey Mudd College)
Ross E Mawhorter (Harvey Mudd College)
Kyle Suver (Harvey Mudd College)
Weiqing Gu (Harvey Mudd College)
Steven Xing (Intel Corporation)
Cody Gabriel (Intel Corporation)
Peter Sankhagowit (Intel Corporation)