ISCA Archive SpeechProsody 2012
ISCA Archive SpeechProsody 2012

Prosody modeling

Jianhua Tao

Prosody is a super-segmental feature of speech and represents various features of speakers or utterances. It is well accepted that the speech prosody can be hierarchically modeled, however it's still the open question that how the hierarchical model is influenced by the acoustic features and context features. Which features are important to label or predict the hierarchical structure for prosody? Among them, pitch accent and phrase play the important roles. In the talk, we will try to give a wide view of the recent research on the hierarchical prosody model, mainly focusing on the features of phrase and pitch accent from acoustical and perceptional aspect in different hierarchical levels. The influence from the syntactic structure will also be introduced. And finally, we will introduce the prediction model for the hierarchical prosody structure and how to use it in text to speech system to get more expressive synthetic results.