ISCA Archive Interspeech 2006
ISCA Archive Interspeech 2006

A framework for robust MFCC feature extraction using SNR-dependent compression of enhanced mel filter bank energies

Babak Nasersharif, Ahmad Akbari

The Mel-frequency cepstral coefficients (MFCC) are most widely used and successful features for speech recognition. But, their performance degrades in presence of additive noise. In this paper, we propose a noise compensation method for Mel filter bank energies and so MFCC features. This compensation method includes two steps: Mel sub-band spectral subtraction and then compression of Mel-Sub-band energies. In the compression step, we propose a sub-band SNR-dependent compression function. We use this function instead of logarithm function in conventional MFCC feature extraction in presence of additive noise. Experimental results show that the proposed method significantly improves MFCC features performance in noisy conditions where it decreases word error rate about 70% in SNR value of 0 dB for different types of additive noise.


doi: 10.21437/Interspeech.2006-9

Cite as: Nasersharif, B., Akbari, A. (2006) A framework for robust MFCC feature extraction using SNR-dependent compression of enhanced mel filter bank energies. Proc. Interspeech 2006, paper 1632-Mon1A2O.3, doi: 10.21437/Interspeech.2006-9

@inproceedings{nasersharif06_interspeech,
  author={Babak Nasersharif and Ahmad Akbari},
  title={{A framework for robust MFCC feature extraction using SNR-dependent compression of enhanced mel filter bank energies}},
  year=2006,
  booktitle={Proc. Interspeech 2006},
  pages={paper 1632-Mon1A2O.3},
  doi={10.21437/Interspeech.2006-9},
  issn={2958-1796}
}