ISCA Archive Interspeech 2014
ISCA Archive Interspeech 2014

Probabilistic linear discriminant analysis with bottleneck features for speech recognition

Liang Lu, Steve Renals

We have recently proposed a new acoustic model based on probabilistic linear discriminant analysis (PLDA) which enjoys the flexibility of using higher dimensional acoustic features, and is more capable to capture the intra-frame feature correlations. In this paper, we investigate the use of bottleneck features obtained from a deep neural network (DNN) for the PLDA-based acoustic model. Experiments were performed on the Switchboard dataset — a large vocabulary conversational telephone speech corpus. We observe significant word error reduction by using the bottleneck features. In addition, we have also compared the PLDA-based acoustic model to three others using Gaussian mixture models (GMMs), subspace GMMs and hybrid deep neural networks (DNNs), and PLDA can achieve comparable or slightly higher recognition accuracy from our experiments.


doi: 10.21437/Interspeech.2014-227

Cite as: Lu, L., Renals, S. (2014) Probabilistic linear discriminant analysis with bottleneck features for speech recognition. Proc. Interspeech 2014, 910-914, doi: 10.21437/Interspeech.2014-227

@inproceedings{lu14b_interspeech,
  author={Liang Lu and Steve Renals},
  title={{Probabilistic linear discriminant analysis with bottleneck features for speech recognition}},
  year=2014,
  booktitle={Proc. Interspeech 2014},
  pages={910--914},
  doi={10.21437/Interspeech.2014-227},
  issn={2958-1796}
}