Authors
Eric Moulines, Francis Charpentier
Publication date
1990/12/1
Journal
Speech communication
Volume
9
Issue
5-6
Pages
453-467
Publisher
North-Holland
Description
We review in a common framework several algorithms that have been proposed recently, in order to improve the voice quality of a text-to-speech synthesis based on acoustical units concatenation (Charpentier and Moulines, 1988; Moulines and Charpentier, 1988; Hamon et al., 1989). These algorithms rely on a pitch-synchronous overlap-add (PSOLA) approach for modifying the speech prosody and concatenating speech waveforms. The modifications of the speech signal are performed either in the frequency domain (FD-PSOLA), using the Fast Fourier Transform, or directly in the time domain (TD-PSOLA), depending on the length of the window used in the synthesis process. The frequency domain approach is capable of a great flexibility in modifying the spectral characteristics of the speech signal, while the time domain approach provides very efficient solutions for the real time implementation of synthesis …
Total citations
199219931994199519961997199819992000200120022003200420052006200720082009201020112012201320142015201620172018201920202021202220232024614112731465947585864636874778561717873941009175748265716077596235