Authors
Cassia Valentini-Botinhao, Junichi Yamagishi, Simon King
Publication date
2011/5/22
Conference
ICASSP
Pages
5112-5115
Publisher
IEEE
Description
In this paper we evaluate four objective measures of speech with regards to intelligibility prediction of synthesized speech in diverse noisy situations. We evaluated three intelligibility measures, the Dau measure, the glimpse proportion and the Speech Intelligibility Index (SII) and a quality measure, the Perceptual Evaluation of Speech Quality (PESQ). For the generation of synthesized speech we used a state of the art HMM-based speech synthesis system. The noisy conditions comprised four additive noises. The measures were compared with subjective intelligibility scores obtained in listening tests. The results show the Dau and the glimpse measures to be the best predictors of intelligibility, with correlations of around 0.83 to subjective scores. All measures gave less accurate predictions of intelligibility for synthetic speech than have previously been found for natural speech; in particular the SII measure. In …
Total citations
20112012201320142015201620172018201920202021202220232457132141
Scholar articles
C Valentini-Botinhao, J Yamagishi, S King - 2011 IEEE International Conference on Acoustics …, 2011