View article

Source-normalized LDA for robust speaker recognition using i-vectors from multiple speech sources

Authors

Mitchell McLaren, David Van Leeuwen

Publication date

2011/8/18

Journal

IEEE Transactions on Audio, Speech, and Language Processing

Volume

Issue

Pages

755-766

Publisher

IEEE

Description

The recent development of the i-vector framework for speaker recognition has set a new performance standard in the research field. An i-vector is a compact representation of a speakers utterance extracted from a total variability subspace. Prior to classification using a cosine kernel, i-vectors are projected into an linear discriminant analysis (LDA) space in order to reduce inter-session variability and enhance speaker discrimination. The accurate estimation of this LDA space from a training dataset is crucial to detection performance. A typical training dataset, however, does not consist of utterances acquired through all sources of interest for each speaker. This has the effect of introducing systematic variation related to the speech source in the between-speaker covariance matrix and results in an incomplete representation of the within-speaker scatter matrix used for LDA. The recently proposed source-normalized …

Total citations

Cited by 99

20122013201420152016201720182019202020212022202320248 5 13 15 11 13 10 6 7 5 2 1 2

Scholar articles

Source-normalized LDA for robust speaker recognition using i-vectors from multiple speech sources

M McLaren, D Van Leeuwen - IEEE Transactions on Audio, Speech, and Language …, 2011