ISCA Archive Interspeech 2012
ISCA Archive Interspeech 2012

A continuous prominence score based on acoustic features

Jean-Philippe Goldman, Mathieu Avanzi, Antoine Auchlin, Anne Catherine Simon

Up to now, prominence detection has mainly been considered a binary matter, a syllable being considered as prominent or not. This contribution aims at developing an automatic detection procedure of gradual prominence. Based on 4 prosodic parameters (relative duration, relative F0, F0 movement and pause duration), the system provides each syllable with a gradual score of prominence ranging from 0 (non-prominent syllable) to 4 (extra prominent syllable). The automatic detection (ProsoProm) relies on a manually annotated corpus (18 minutes, or 3669 syllables, of speech annotated by three experts) and is cumulative (the relative weight of each parameter is taken into account in order to compute a global score for each syllable). Discussion of the results includes a comparison to a similar software (Analor) and a qualitative analysis of misses and false detections. The agreement between automatic and (median) human annotation reaches a Kappa score of 0.8.

Index Terms: prosody, speech, prominence detection, cumulative prominence, automatic prosodic analysis, expert vs automatic prosodic annotation