View article

[PDF] from ru.nl

Source normalization for language-independent speaker recognition using i-vectors

Authors

ML McLaren, Miranti Indar Mandasari, David A van Leeuwen

Publication date

2012

Publisher

Singapore:[Sn]

Description

Source-normalization (SN) is an effective means of improving the robustness of i-vector-based speaker recognition for under-resourced and unseen cross-speech-source evaluation conditions. The technique of source-normalization estimates directions of undesired within-speaker variation more accurately than traditional methods when cross-source variation is not explicitly observed from each speaker in system development data. Source normalization can be incorporated into Within Class Covariance Normalization (WCCN) as an effective preprocessing step to Probabilistic Linear Discriminant Analysis (PLDA) based speaker recognition with i-vectors. This paper proposes to extend the application of sourcenormalization to the reduction of language-dependence in PLDA speaker recognition by normalising for the variation that separates languages. Evaluated on the NIST 2008 and 2010 speaker recognition evaluation (SRE) data sets, the proposed Language Normalized WCCN (LN-WCCN) provides relative improvements of 26% in minimum DCF and 14% in EER under multilingual scenarios without detriment to common Englishonly conditions. LN-WCCN is also shown to significantly improve calibration performance when calibration parameters are learned from scores mismatched to evaluation conditions.

Total citations

Cited by 33

201320142015201620172018201920202021202220233 2 5 6 6 5 2 2 1 1

Scholar articles

Source normalization for language-independent speaker recognition using i-vectors

ML McLaren, MI Mandasari, DA van Leeuwen - 2012