ISCA Archive Interspeech 2006
ISCA Archive Interspeech 2006

Stochastic vector mapping-based feature enhancement using prior model and environment adaptation for noisy speech recognition

Chia-Hsin Hsieh, Chung-Hsien Wu, Jun-Yu Lin

This paper presents an approach to feature enhancement for noisy speech recognition. Three prior models are introduced to characterize clean speech, noise and noisy speech respectively using sequential noise estimation based on noise-normalized stochastic vector mapping. Environment adaptation is also adopted to reduce the mismatch between training data and test data. For AURORA2 database, the experimental results indicate that a 0.77% digit accuracy improvement for multi-condition training and 0.29% digit accuracy improvement for clean speech training were achieved without stereo training data compared to the SPLICE-based approach with recursive noise estimation. For MAT-BN Mandarin broadcast news database, a (2).6% syllable accuracy improvement for anchor speech and 4.2% syllable accuracy improvement for field report speech were obtained compared to the MCE-based approach.


doi: 10.21437/Interspeech.2006-8

Cite as: Hsieh, C.-H., Wu, C.-H., Lin, J.-Y. (2006) Stochastic vector mapping-based feature enhancement using prior model and environment adaptation for noisy speech recognition. Proc. Interspeech 2006, paper 1170-Mon1A2O.2, doi: 10.21437/Interspeech.2006-8

@inproceedings{hsieh06_interspeech,
  author={Chia-Hsin Hsieh and Chung-Hsien Wu and Jun-Yu Lin},
  title={{Stochastic vector mapping-based feature enhancement using prior model and environment adaptation for noisy speech recognition}},
  year=2006,
  booktitle={Proc. Interspeech 2006},
  pages={paper 1170-Mon1A2O.2},
  doi={10.21437/Interspeech.2006-8},
  issn={2958-1796}
}