View article

[PDF] from illinois.edu

Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones

Authors

Eric Moulines, Francis Charpentier

Publication date

1990/12/1

Journal

Speech communication

Volume

Issue

5-6

Pages

453-467

Publisher

North-Holland

Description

We review in a common framework several algorithms that have been proposed recently, in order to improve the voice quality of a text-to-speech synthesis based on acoustical units concatenation (Charpentier and Moulines, 1988; Moulines and Charpentier, 1988; Hamon et al., 1989). These algorithms rely on a pitch-synchronous overlap-add (PSOLA) approach for modifying the speech prosody and concatenating speech waveforms. The modifications of the speech signal are performed either in the frequency domain (FD-PSOLA), using the Fast Fourier Transform, or directly in the time domain (TD-PSOLA), depending on the length of the window used in the synthesis process. The frequency domain approach is capable of a great flexibility in modifying the spectral characteristics of the speech signal, while the time domain approach provides very efficient solutions for the real time implementation of synthesis …

Total citations

Cited by 2057

1992199319941995199619971998199920002001200220032004200520062007200820092010201120122013201420152016201720182019202020212022202320246 14 11 27 31 46 59 47 58 58 64 63 68 74 77 85 61 71 78 73 94 100 91 75 74 82 65 71 60 77 59 62 35

Scholar articles

Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones

E Moulines, F Charpentier - Speech communication, 1990

Cited by 2057 Related articles All 14 versions