ISCA Archive Interspeech 2012
ISCA Archive Interspeech 2012

Using blob detection in missing feature linear-frequency cepstral coefficients for robust sound event recognition

Yi Ren Leng, Huy Dat Tran

The Missing Feature Linear-Frequency Cepstral Coefficients (MF-LFCC) is a noise robust cepstral feature that transforms both clean and noisy signals into a similar representation. Unlike conventional Missing Feature Techniques, the MF-LFCC does not require spectrogram imputation or classifier modification. To improve the noise mask used in the MF-LFCC, we propose to use the computer vision technique of blob detection to identify the peaks characterizing the sparsity of sound event spectrograms. For single sound event recognition using SVM classifiers, the MF-LFCC is shown to significantly outperform the MFCC baseline and the noise robust ESTI Advanced Front End feature in noisy conditions.

Index Terms: blob detection, missing feature, robust recognition, sound event recognition