ISCA Archive ICSLP 2002
ISCA Archive ICSLP 2002

A data-driven approach to source-formant type text-to-speech system

Hiroki Mori, Takahiro Ohtsuka, Hideki Kasuya

A data-driven formant-type TTS system is proposed. The formanttype speech synthesizer is one of the most promising architectures to enable flexible control of various voice qualities. By applying the ARX-based speech analysis method, source and formant parameters are automatically obtained. It is shown that a TTS system can be built by using the parameters, without requiring any heuristic rules to control vocal tract characteristics.