ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Unit selection based speech synthesis for poor channel condition

Ling Cen, Minghui Dong, Paul Chan, Haizhou Li

Synthesized speech can be largely degraded in noise, resulting in compromised speech quality. In this paper, we propose a unit selection based speech synthesis system for better speech quality under poor channel conditions. First, the measurement of speech intelligibility is incorporated in the cost function as a searching criterion for unit selection. Next, the prosody of the selected units is modified according to the Lombard effect. Prosody modification includes increasing the amplitude of unvoiced phoneme and enlarging the speech duration. Finally, the FIR equalization via convex optimization is applied to reduce signal distortion due to the channel effect. Listening test in our experiments shows that the quality level of synthetic speech can be improved under poor channel conditions with the help of our proposed synthesis system.

doi: 10.21437/Interspeech.2009-595

Cite as: Cen, L., Dong, M., Chan, P., Li, H. (2009) Unit selection based speech synthesis for poor channel condition. Proc. Interspeech 2009, 2075-2078, doi: 10.21437/Interspeech.2009-595

  author={Ling Cen and Minghui Dong and Paul Chan and Haizhou Li},
  title={{Unit selection based speech synthesis for poor channel condition}},
  booktitle={Proc. Interspeech 2009},