ISCA Archive Interspeech 2014
ISCA Archive Interspeech 2014

Objective quality evaluation of noise-suppressed speech: effects of temporal envelope and fine-structure cues

Fei Chen, Yi Hu

While temporal envelope and fine-structure cues are known to be good predictors for speech intelligibility, it is not clear how well they are correlated with subjective quality ratings, particularly those using noise-suppressed speech. The present work evaluated the performance of two objective measures (i.e., NCM and TFSS), which were originally developed with primarily envelope or fine-structure cue as speech intelligibility indices, when they were applied for predicting the subjective quality ratings of noise-suppressed speech along three dimensions of signal distortion, noise distortion and overall quality. We considered a wide range of distortion introduced by four types of real-world noises at two signal-to-noise-ratio levels and by four classes of noise-suppression algorithms. This work finds that the present envelope- and fine-structure-based measures poorly predict the subjective quality ratings of noise-suppressed speech. The PESQ measure is so far the best choice in terms of objectively evaluating both subjective quality ratings and intelligibility scores of noise-suppressed speech.


doi: 10.21437/Interspeech.2014-467

Cite as: Chen, F., Hu, Y. (2014) Objective quality evaluation of noise-suppressed speech: effects of temporal envelope and fine-structure cues. Proc. Interspeech 2014, 2055-2058, doi: 10.21437/Interspeech.2014-467

@inproceedings{chen14j_interspeech,
  author={Fei Chen and Yi Hu},
  title={{Objective quality evaluation of noise-suppressed speech: effects of temporal envelope and fine-structure cues}},
  year=2014,
  booktitle={Proc. Interspeech 2014},
  pages={2055--2058},
  doi={10.21437/Interspeech.2014-467},
  issn={2308-457X}
}