ISCA Archive Interspeech 2014
ISCA Archive Interspeech 2014

Speech pre-enhancement using a discriminative microscopic intelligibility model

Maryam Al Dabel, Jon Barker

We propose a new approach for optimally pre-enhancing speech signals for given noise conditions. Like others, we optimise the predicted intelligibility of the signal, however, we employ a statistical `microscopic' intelligibility model that encodes information about which spectro-temporal speech regions are most informative. Uniquely, our optimisation strategy aims to maximise the discrimination between the correct interpretation and competing incorrect interpretations of the utterance. We present results from studies that use speech-shaped stationary noise maskers and show the new strategy leads to solutions that are more varied than the simple high frequency emphasis employed in many pre-enhancement systems.


doi: 10.21437/Interspeech.2014-470

Cite as: Dabel, M.A., Barker, J. (2014) Speech pre-enhancement using a discriminative microscopic intelligibility model. Proc. Interspeech 2014, 2068-2072, doi: 10.21437/Interspeech.2014-470

@inproceedings{dabel14_interspeech,
  author={Maryam Al Dabel and Jon Barker},
  title={{Speech pre-enhancement using a discriminative microscopic intelligibility model}},
  year=2014,
  booktitle={Proc. Interspeech 2014},
  pages={2068--2072},
  doi={10.21437/Interspeech.2014-470},
  issn={2308-457X}
}