ISCA Archive Interspeech 2012
ISCA Archive Interspeech 2012

Automatic detection of high vocal effort in telephone speech

Jouni Pohjalainen, Tuomo Raitio, Hannu Pulakka, Paavo Alku

A system is proposed for the automatic detection of high vocal effort in speech. The system is evaluated using both PCM-coded speech and AMRcoded telephone speech. In addition, the effect of far-end noise in the telephone conditions is studied using both matched-condition training and cases with additive noise mismatch. The proposed system is based on Bayesian classification of mel-frequency cepstral feature vectors. Concerning the MFCC feature extraction process, the substitution of a spectrum analysis method emphasizing the fine structure improves the results in the noisy cases.

Index Terms: vocal effort detection, speech analysis


doi: 10.21437/Interspeech.2012-217

Cite as: Pohjalainen, J., Raitio, T., Pulakka, H., Alku, P. (2012) Automatic detection of high vocal effort in telephone speech. Proc. Interspeech 2012, 691-694, doi: 10.21437/Interspeech.2012-217

@inproceedings{pohjalainen12b_interspeech,
  author={Jouni Pohjalainen and Tuomo Raitio and Hannu Pulakka and Paavo Alku},
  title={{Automatic detection of high vocal effort in telephone speech}},
  year=2012,
  booktitle={Proc. Interspeech 2012},
  pages={691--694},
  doi={10.21437/Interspeech.2012-217},
  issn={2958-1796}
}