ISCA Archive Interspeech 2014
ISCA Archive Interspeech 2014

Semi-supervised training for bottle-neck feature based DNN-HMM hybrid systems

Haihua Xu, Hang Su, Eng Siong Chng, Haizhou Li

In this paper, we investigate semi-supervised training (SST) method in various state-of-the-art acoustic modeling techniques, using bottle-neck and corresponding tandem features. These techniques include subspace GMM, tanh-neuron deep neural network (DNN), and a generalized soft-maxout (p-norm) DNN. We demonstrate that SST may lead up to 2% Word Error Rate (WER) reduction using all these techniques in each case, and the best one comes from tandem feature based p-norm DNN system. In addition to recognition performance, effectiveness of the SST on keyword search performance is also investigated. Results on Actual Term Weighted Value (ATWV) are reported, with an analysis on lattice density. It is shown that SST may not necessarily increase ATWV due to the shrink of lattices size.


doi: 10.21437/Interspeech.2014-472

Cite as: Xu, H., Su, H., Chng, E.S., Li, H. (2014) Semi-supervised training for bottle-neck feature based DNN-HMM hybrid systems. Proc. Interspeech 2014, 2078-2082, doi: 10.21437/Interspeech.2014-472

@inproceedings{xu14d_interspeech,
  author={Haihua Xu and Hang Su and Eng Siong Chng and Haizhou Li},
  title={{Semi-supervised training for bottle-neck feature based DNN-HMM hybrid systems}},
  year=2014,
  booktitle={Proc. Interspeech 2014},
  pages={2078--2082},
  doi={10.21437/Interspeech.2014-472},
  issn={2308-457X}
}