ISCA Archive Eurospeech 1997
ISCA Archive Eurospeech 1997

A novel training approach for improving speech recognition under adverse stressful conditions

Sahar E. Bou-Ghazale, John H. L. Hansen

This paper presents a new training approach for improving recognition of speech under emotional and environmental stress. The proposed approach consists of training a speech recognizer with synthetically generated speech under each stress condition using stress perturbation models previously formulated in [4, 1]. The perturbation models were previously formulated to statistically model the parameter variations under angry, loud, and Lombard effect and were employed in an analysis-synthesis scheme for generating stressed synthetic speech from isolated neutral speech. In this paper, two training approaches employing the synthetically generated stressed speech are presented consisting of : speaker-independent, and speaker-adaptive training methods. Both approaches outperform neutral trained recognizers when tested with angry, loud, and Lombard effect speech.


doi: 10.21437/Eurospeech.1997-627

Cite as: Bou-Ghazale, S.E., Hansen, J.H.L. (1997) A novel training approach for improving speech recognition under adverse stressful conditions. Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997), 2387-2390, doi: 10.21437/Eurospeech.1997-627

@inproceedings{boughazale97_eurospeech,
  author={Sahar E. Bou-Ghazale and John H. L. Hansen},
  title={{A novel training approach for improving speech recognition under adverse stressful conditions}},
  year=1997,
  booktitle={Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997)},
  pages={2387--2390},
  doi={10.21437/Eurospeech.1997-627},
  issn={1018-4074}
}