ISCA Archive Interspeech 2014
ISCA Archive Interspeech 2014

Enhancing audio source separability using spectro-temporal regularization with NMF

Colin Vaz, Dimitrios Dimitriadis, Shrikanth S. Narayanan

We propose a spectro-temporal regularization approach for NMF that accounts for a source's spectral variability over time. The regularization terms allow NMF to adapt the spectral basis matrices optimally to reduce mismatch between the spectral characteristics of sources observed during training and encountered during separation. We first tested our algorithm on a simulated source separation task. Preliminary results show significant improvement of SAR, SDR, and SIR values over some current NMF methods. We also tested our algorithm on a speech enhancement task and were able to show a modest improvement of the PESQ scores of the recovered speech.


doi: 10.21437/Interspeech.2014-216

Cite as: Vaz, C., Dimitriadis, D., Narayanan, S.S. (2014) Enhancing audio source separability using spectro-temporal regularization with NMF. Proc. Interspeech 2014, 855-859, doi: 10.21437/Interspeech.2014-216

@inproceedings{vaz14_interspeech,
  author={Colin Vaz and Dimitrios Dimitriadis and Shrikanth S. Narayanan},
  title={{Enhancing audio source separability using spectro-temporal regularization with NMF}},
  year=2014,
  booktitle={Proc. Interspeech 2014},
  pages={855--859},
  doi={10.21437/Interspeech.2014-216},
  issn={2308-457X}
}