ISCA Archive Interspeech 2014
ISCA Archive Interspeech 2014

Deep neural network bottleneck features for generalized variable parameter HMMs

Xurong Xie, Rongfeng Su, Xunying Liu, Lan Wang

Recently deep neural networks (DNNs) have become increasingly popular for acoustic modelling in automatic speech recognition (ASR) systems. As the bottleneck features they produce are inherently discriminative and contain rich hidden factors that influence the surface acoustic realization, the standard approach is to augment the conventional acoustic features with the bottleneck features in a tandem framework. In this paper, an alternative approach to incorporate bottleneck features is investigated. The complex relationship between acoustic features and DNN bottleneck features is modelled using generalized variable parameter HMMs (GVP-HMMs). The optimal GVP-HMM structural configuration and model parameters are automatically learnt. Significant error rate reductions of 48% and 8% relative were obtained over the baseline multi-style HMM and tandem HMM systems respectively on Aurora 2.


doi: 10.21437/Interspeech.2014-158

Cite as: Xie, X., Su, R., Liu, X., Wang, L. (2014) Deep neural network bottleneck features for generalized variable parameter HMMs. Proc. Interspeech 2014, 2739-2743, doi: 10.21437/Interspeech.2014-158

@inproceedings{xie14_interspeech,
  author={Xurong Xie and Rongfeng Su and Xunying Liu and Lan Wang},
  title={{Deep neural network bottleneck features for generalized variable parameter HMMs}},
  year=2014,
  booktitle={Proc. Interspeech 2014},
  pages={2739--2743},
  doi={10.21437/Interspeech.2014-158},
  issn={2308-457X}
}