View article

[PDF] from epfl.ch

A spectrogram model for enhanced source localization and noise-robust ASR

Authors

Guillaume Lathoud, Mathew Magimai.-Doss, Bertrand Mesot

Publication date

2005

Conference

Proceedings of INTERSPEECH 2005

Issue

EPFL-CONF-83299

Description

This paper proposes a simple, computationally efficient 2-mixture model approach to discrimination between speech and background noise. It is directly derived from observations on real data, and can be used in a fully unsupervised manner, with the EM algorithm. A first application to sector-based, joint audio source localization and detection, using multiple microphones, confirms that the model can provide major enhancement. A second application to the single channel speech recognition task in a noisy environment yields major improvement on stationary noise and promising results on non-stationary noise.

Total citations

Cited by 4

200520061 3

Scholar articles

A spectrogram model for enhanced source localization and noise-robust ASR

G Lathoud, B Mesot - Proceedings of Interspeech 2005, 2005