Authors
Eric Moulines, Jean Laroche
Publication date
1995/2/1
Source
Speech communication
Volume
16
Issue
2
Pages
175-205
Publisher
North-Holland
Description
Time-scale and, to a lesser extent, pitch-scale modifications of speech and audio signals are the subject of major theoretical and practical interest. Applications are numerous, including, to name but a few, text-to-speech synthesis (based on acoustical unit concatenation), transformation of voice characteristics, foreign language learning but also audio monitoring or film/soundtrack post-synchronization. To fulfill the need for high-quality time and pitch-scaling, a number of algorithms have been proposed recently, along with their real-time implementation, sometimes for very inexpensive hardware. It appears that most of these algorithms can be viewed as slight variations of a small number of basic schemes. This contribution reviews frequency-domain algorithms (phase-vocoder) and time-domain algorithms (Time-Domain Pitch-Synchronous Overlap/Add and the like) in the same framework. More recent variations of …
Total citations
19951996199719981999200020012002200320042005200620072008200920102011201220132014201520162017201820192020202120222023202452191014107121015192214272616143624261810311815181213107