ISCA Archive SpeechProsody 2004
ISCA Archive SpeechProsody 2004

HMM-based speech synthesis with various speaking styles using model interpolation

Makoto Tachibana, Junichi Yamagishi, Koji Onishi, Takashi Masuko, Takao Kobayashi

This paper presents an approach to realizing various speaking styles and emotional expressions using a model interpolation technique in HMM-based speech synthesis. In the approach, we synthesize speech with an intermediate speaking style between representative speaking styles from a model obtained by interpolating representative style models. We chose three styles, "reading," "joyful," and "sad," as representative styles, and synthesized speech from models obtained by interpolating two models for every combination of two styles. From a result of a subjective similarity evaluation, it is shown that speech generated from an interpolated model has a speaking style in between two representative speaking styles.