ISCA Archive Interspeech 2012
ISCA Archive Interspeech 2012

Sparse banded precision matrices for low resource speech recognition

Weibin Zhang, Pascale Fung

We propose to use sparse banded precision matrices for speech recognition when there is insufficient training data. Previously we proposed a method to drive the structure of precision matrices to sparse under the HMM framework during training. The recognition accuracy of this compact model is shown to be better than full covariance or diagonal covariance systems. In this paper we propose to modify the penalization to automatically learn sparse banded precision matrices. This will drive the models trained even more compact. We demonstrate the importance of the order of features to the success of our proposed method. Using our proposed feature order, we can substantially reduce the right halfbandwidth of the sparse banded matrices without sacrificing the recognition accuracy. This saves memory and computation.

Index Terms: low resource speech recognition, sparse precision matrix, sparse banded precision matrix