Authors
Ahilan Kanagasundaram, Robert Vogt, David Dean, Sridha Sridharan
Publication date
2012
Journal
Proceedings of The Speaker and Language Recognition Workshop: Odyssey 2012
Pages
28-33
Publisher
International Speech Communication Association
Description
This paper investigates the effects of limited speech data in the context of speaker verification using a probabilistic linear discriminant analysis (PLDA) approach. Being able to reduce the length of required speech data is important to the development of automatic speaker verification system in real world applications. When sufficient speech is available, previous research has shown that heavy-tailed PLDA (HTPLDA) modeling of speakers in the i-vector space provides state-of-the-art performance, however, the robustness of HTPLDA to the limited speech resources in development, enrolment and verification is an important issue that has not yet been investigated. In this paper, we analyze the speaker verification performance with regards to the duration of utterances used for both speaker evaluation (enrolment and verification) and score normalization and PLDA modeling during development. Two different approaches to total-variability representation are analyzed within the PLDA approach to show improved performance in short-utterance mismatched evaluation conditions and conditions for which insufficient speech resources are available for adequate system development. The results presented within this paper using the NIST 2008 Speaker Recognition Evaluation dataset suggest that the HTPLDA system can continue to achieve better performance than Gaussian PLDA (GPLDA) as evaluation utterance lengths are decreased. We also highlight the importance of matching durations for score normalization and PLDA modeling to the expected evaluation conditions. Finally, we found that a pooled total-variability approach to PLDA modeling …
Total citations
201120122013201420152016201720182019202020212022202320241711242530403226372020124
Scholar articles
A Kanagasundaram, R Vogt, D Dean, S Sridharan - Proceedings of The Speaker and Language …, 2012
A Kanagasundaram, R Vogt, D Dean, S Sridharan - The Speaker and Language Recognition Workshop …, 2012