ISCA Archive Interspeech 2014
ISCA Archive Interspeech 2014

Multi-channel speech enhancement using sparse coding on local time-frequency structures

Zhiyuan Zhou, Zhaogui Ding, Weifeng Li, Zhiyong Wu, Longbiao Wang, Qingmin Liao

A novel multi-channel speech enhancement technique is proposed in the present paper. We focus on the local sparsities of speech signals in contrast to the conventional beamforming and blind source seperation methods. The technique utilizes the difference of local structures in temporary-frequency domain between the target speech and interfering signals for enhancing the target speech. We first estimate the local structures of the speech and noise signals at each time-frequency bin to form a local dictionary, and then recover the clean speech via sparse coding. The proposed algorithm is simple to implement and requires no prior knowledge of speech and noise. Our experimental evaluations demonstrate that the proposed method can suppress interferer and meantime preserve target speech more than some conventional methods.


doi: 10.21437/Interspeech.2014-580

Cite as: Zhou, Z., Ding, Z., Li, W., Wu, Z., Wang, L., Liao, Q. (2014) Multi-channel speech enhancement using sparse coding on local time-frequency structures. Proc. Interspeech 2014, 2824-2827, doi: 10.21437/Interspeech.2014-580

@inproceedings{zhou14b_interspeech,
  author={Zhiyuan Zhou and Zhaogui Ding and Weifeng Li and Zhiyong Wu and Longbiao Wang and Qingmin Liao},
  title={{Multi-channel speech enhancement using sparse coding on local time-frequency structures}},
  year=2014,
  booktitle={Proc. Interspeech 2014},
  pages={2824--2827},
  doi={10.21437/Interspeech.2014-580},
  issn={2308-457X}
}