Authors
Yannis Stylianou, Olivier Cappé, Eric Moulines
Publication date
1998/3
Journal
IEEE Transactions on speech and audio processing
Volume
6
Issue
2
Pages
131-142
Publisher
IEEE
Description
Voice conversion, as considered in this paper, is defined as modifying the speech signal of one speaker (source speaker) so that it sounds as if it had been pronounced by a different speaker (target speaker). Our contribution includes the design of a new methodology for representing the relationship between two sets of spectral envelopes. The proposed method is based on the use of a Gaussian mixture model of the source speaker spectral envelopes. The conversion itself is represented by a continuous parametric function which takes into account the probabilistic classification provided by the mixture model. The parameters of the conversion function are estimated by least squares optimization on the training data. This conversion method is implemented in the context of the HNM (harmonic+noise model) system, which allows high-quality modifications of speech signals. Compared to earlier methods based on …
Total citations
20002001200220032004200520062007200820092010201120122013201420152016201720182019202020212022202320244811222233376056798861688510582827872927172343611
Scholar articles
Y Stylianou, O Cappé, E Moulines - IEEE Transactions on speech and audio processing, 1998