ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

Detecting laughter and filled pauses using syllable-based features

Gouzhen An, David Guy Brizan, Andrew Rosenberg

Identifying laughter and filled pauses is important to understanding spontaneous human speech. These are two common vocal expressions that are non-lexical and incredibly communicative. In this paper, we use a two-tiered system for identifying laughter and filled pauses. We first generate frame level hypotheses and subsequently rescore these based on features derived from acoustic syllable segmentation. Using Interspeech 2013 ComParE challenge corpus, SVC, we find that these rescoring experiments and inclusion of syllable based acoustic/prosodic features allow for the detection of laughter and filled pauses by at 89.3% UAAUC on the development set, an improvement of 1.7% over the challenge baseline.

doi: 10.21437/Interspeech.2013-62

Cite as: An, G., Brizan, D.G., Rosenberg, A. (2013) Detecting laughter and filled pauses using syllable-based features. Proc. Interspeech 2013, 178-181, doi: 10.21437/Interspeech.2013-62

  author={Gouzhen An and David Guy Brizan and Andrew Rosenberg},
  title={{Detecting laughter and filled pauses using syllable-based features}},
  booktitle={Proc. Interspeech 2013},