ISCA Archive Interspeech 2012
ISCA Archive Interspeech 2012

HMM-based speech synthesis using sub-band basis spectrum model

Yamato Ohtani, Masatsune Tamura, Masahiro Morita, Takehiko Kagoshima, Masami Akamine

In this paper, we propose a HMM-based text-to-speech (TTS) using sub-band basis spectrum model (SBM). SBM can represent vocal tract spectra and phase characteristics by liner combination of sub-band basis vectors. Some reports suggest that analysis-synthesized speech based on SBM is close to the natural speech and SBM can perform effectively in the text-to-speech. Therefore, SBM framework is expected to improve speech quality to have good effects on the HMM-based TTS. Subjective experimental results show that the proposed method improves speech quality in some conditions.

Index Terms: speech synthesis, hidden Markov model, sub-band basis spectrum model, phase feature