ISCA Archive ICSLP 2002
ISCA Archive ICSLP 2002

Double the trouble: handling noise and reverberation in far-field automatic speech recognition

David Gelbart, Nelson Morgan

Far-field microphone speech signals cause high error rates for automatic speech recognition systems, due to room reverberation and lower signal-to-noise ratios. We have observed large increases in speech recognition word error rates when using a far-field (3-6 feet)microphone in a conference room, in comparison with recordings from close-talking microphones. In an earlier paper, we showed improvements in far-field speech recognition performance using a long-term log spectral subtraction method to combat reverberation. This method is based on a principle similar to cepstral mean subtraction but uses a much longer analysis window (e.g., 1 s) in order to deal with reverberation. Here we show that a combination of short-term noise filtering and long-term log spectral subtraction can further reduce recognition word error rates.