ISCA Archive Interspeech 2012
ISCA Archive Interspeech 2012

Binary mask estimation for improved speech intelligibility in reverberant environments

Oldooz Hazrati, Jaewook Lee, Philipos C. Loizou

A blind (non-ideal) time-frequency (T-F) masking technique is proposed for suppressing reverberation. A binary mask is estimated at each T-F unit by extracting a single variance-based feature from the reverberant signal and comparing its value against an adaptive threshold. The performance of the estimated binary mask is evaluated using intelligibility listening tests with hearing impaired listeners in four moderate to highly reverberant conditions. Results indicated that the proposed T-F masking technique yielded significant improvements in intelligibility even in highly reverberant conditions (T60=1.0 s). This improvement was attributed to the recovery of the vowel/consonant boundaries which are severely smeared in reverberation.

Index Terms: Binary mask, cochlear implant (CI), dereverberation