Authors
Guillaume Lathoud, Mathew Magimai.-Doss, Bertrand Mesot
Publication date
2005
Conference
Proceedings of INTERSPEECH 2005
Issue
EPFL-CONF-83299
Description
This paper proposes a simple, computationally efficient 2-mixture model approach to discrimination between speech and background noise. It is directly derived from observations on real data, and can be used in a fully unsupervised manner, with the EM algorithm. A first application to sector-based, joint audio source localization and detection, using multiple microphones, confirms that the model can provide major enhancement. A second application to the single channel speech recognition task in a noisy environment yields major improvement on stationary noise and promising results on non-stationary noise.
Total citations
2005200613
Scholar articles