ISCA Archive Interspeech 2012
ISCA Archive Interspeech 2012

Speech recognition by denoising and dereverberation based on spectral subtraction in a real noisy reverberant environment

Kyohei Odani, Longbiao Wang, Atsuhiko Kai

A blind dereverberation method based on spectral subtraction (SS) using a multi-channel least mean squares (MCLMS) algorithm was previously proposed. The results of a large vocabulary continuous speech recognition (LVCSR) task showed that this method achieved significant improvements over the conventional method based on cepstral mean normalization (CMN) and beamforming in a simulated reverberant environment without additive noise. In this paper, we evaluate the blind dereverberation method in a real noisy reverberant environment. We present a denoising and dereverberation method based on power SS or generalized SS (GSS), and evaluate our proposed method using speech in a real environment.The GSS-based method achieves an average relative word error reduction rate of 39.1% and 11.5% compared to the conventional CMN and power-SS based methods, respectively.

Index Terms: hands-free speech recognition, blind dere- verberation, noise reduction, spectral subtraction, real en- vironment