ISCA Archive Interspeech 2011
ISCA Archive Interspeech 2011

Detection of shouted speech in the presence of ambient noise

Jouni Pohjalainen, Tuomo Raitio, Paavo Alku

This study focuses on the detection of shouted speech in realistic noisy conditions. An automatic system based on modified mel frequency cepstral coefficient (MFCC) feature extraction and Gaussian mixture model (GMM) classification is developed. The performance of the automatic system is compared against human perception measured by a listening test. At moderate noise levels, the automatic system outperforms humans. In severe conditions, classification by humans is clearly better.