ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

Front-end improvements to reduce stationary & variable channel and noise distortions in continuous speech recognition tasks

Xavier Menéndez-Pidal, Ruxin Chen, Duanpei Wu, Mick Tanaka

This paper introduces our actual work in front-end techniques to obtain robust speech recognition devices in mismatch conditions (additive noise mismatch and channel mismatch). Two algorithms have been combined to compensate the distortions due to different channel characteristics and additive noise: 1) A Cepstral Mean Normalization and Variance Scaling technique (MNVS) and 2) An Adaptive Gaussian Attenuation algorithm (AGA). Combining both techniques the channel distortion effects were reduced to 90% on the HTIMIT task and the additive noise effects were reduced to 80% on the TIMIT task corrupted with additive car noise.