ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

Discriminative maximum entropy language model for speech recognition

Chuang-Hua Chueh, To-Chang Chien, Jen-Tzung Chien

This paper presents a new discriminative language model based on the whole-sentence maximum entropy (ME) framework. In the proposed discriminative ME (DME) model, we exploit an integrated linguistic and acoustic model, which properly incorporates the features from n-gram model and acoustic log likelihoods of target and competing models. Through the constrained optimization of integrated model, we estimate DME language model for speech recognition. Attractively, we illustrate the relation between DME estimation and the maximum mutual information (MMI) estimation for language modeling. It is interesting to find that using the sentence-level log likelihood ratios of competing and target sentences as the acoustic features for ME language modeling is equivalent to performing MMI discriminative language modeling. In the experiments on speech recognition, we show that DME model achieved lower word error rate compared to conventional ME model.


doi: 10.21437/Interspeech.2005-10

Cite as: Chueh, C.-H., Chien, T.-C., Chien, J.-T. (2005) Discriminative maximum entropy language model for speech recognition. Proc. Interspeech 2005, 721-724, doi: 10.21437/Interspeech.2005-10

@inproceedings{chueh05_interspeech,
  author={Chuang-Hua Chueh and To-Chang Chien and Jen-Tzung Chien},
  title={{Discriminative maximum entropy language model for speech recognition}},
  year=2005,
  booktitle={Proc. Interspeech 2005},
  pages={721--724},
  doi={10.21437/Interspeech.2005-10},
  issn={2958-1796}
}