ISCA Archive Interspeech 2012
ISCA Archive Interspeech 2012

More on the normalization of syllable prominence ratings

Christopher Sappok, Denis Arnold

The perception of syllable prominence depends to a limited extent on the acoustic properties of the speech signal in question. Psychoacoustic factors are involved as well. Thus, research often relies on two types of data: subjective prominence ratings collected in perception experiments and acoustic measures. A problem with the rating data is noise resulting from individual approaches to the rating task. This paper addresses the question of how this noise can be reduced by normalization, evaluating 12 normalization methods. In a perception experiment, prominence ratings concerning German read speech were collected. From the raw rating data 12 different emirrorf data-sets were computed according to the 12 methods. Each mirror data-set was correlated with the same set of underlying acoustic data. The multiple regression setup included raw syllable duration and within-syllable maximum F0 and intensity. Adjusted R2-values could be raised considerably with selected methods.

Index Terms: perception experiment, inter-rater variability, intra-rater variability, read speech, German, prose, poetry