ISCA Archive Interspeech 2014
ISCA Archive Interspeech 2014

Using hidden Markov models for speech enhancement

Akihiro Kato, Ben Milner

This work presents an approach to speech enhancement that operates using a speech production model to reconstruct a clean speech signal from a set of speech parameters that are estimated from the noisy speech. The motivation is to remove the distortion and residual and musical noises that are associated with conventional filtering-based methods of speech enhancement. The STRAIGHT vocoder forms the model for speech reconstruction and requires a time-frequency surface and fundamental frequency information. Hidden Markov model synthesis is used to create an estimate of the time-frequency surface and this is combined with the noisy surface using a perceptually motivated signal-to-noise ratio weighting. Experimental results compare the proposed reconstruction-based method to conventional filtering-based approaches of speech enhancement.


doi: 10.21437/Interspeech.2014-576

Cite as: Kato, A., Milner, B. (2014) Using hidden Markov models for speech enhancement. Proc. Interspeech 2014, 2695-2699, doi: 10.21437/Interspeech.2014-576

@inproceedings{kato14_interspeech,
  author={Akihiro Kato and Ben Milner},
  title={{Using hidden Markov models for speech enhancement}},
  year=2014,
  booktitle={Proc. Interspeech 2014},
  pages={2695--2699},
  doi={10.21437/Interspeech.2014-576},
  issn={2308-457X}
}