ISCA Archive Interspeech 2014
ISCA Archive Interspeech 2014

A comparison of training approaches for discriminative segmental models

Hao Tang, Kevin Gimpel, Karen Livescu

Segmental models such as segmental conditional random fields have had some recent success in lattice rescoring for speech recognition. They provide a flexible framework for incorporating a wide range of features across different levels of units, such as phones and words. However, such models have mainly been trained by maximizing conditional likelihood, which may not be the best proxy for the task loss of speech recognition. In addition, there has been little work on designing cost functions as surrogates for the word error rate. In this paper, we investigate various losses and introduce a new cost function for training segmental models. We compare lattice rescoring results for multiple tasks and also study the impact of several choices required when optimizing these losses.


doi: 10.21437/Interspeech.2014-307

Cite as: Tang, H., Gimpel, K., Livescu, K. (2014) A comparison of training approaches for discriminative segmental models. Proc. Interspeech 2014, 1219-1223, doi: 10.21437/Interspeech.2014-307

@inproceedings{tang14_interspeech,
  author={Hao Tang and Kevin Gimpel and Karen Livescu},
  title={{A comparison of training approaches for discriminative segmental models}},
  year=2014,
  booktitle={Proc. Interspeech 2014},
  pages={1219--1223},
  doi={10.21437/Interspeech.2014-307},
  issn={2308-457X}
}