ISCA Archive Eurospeech 1995
ISCA Archive Eurospeech 1995

A binaural selectivity model for speech recognition

Markus Bodden, Timothy R. Anderson

Neural networks that employed unsupervised learning were used on the output of a binaural auditory model, know as the Cocktail-Party-Processor, to perform context-independent phoneme recognition. Experiments which compared the performance of the binaural model representation to that of a monaural version showed that the binaural model performed significantly better in terms of phoneme recognition accuracy under the conditions tested (low signal-to-noise ratios (SNR) and a small database of speakers). The binaural model representations' performance has approximately a 20 dB SNR advantage over the monaural representation.