ISCA Archive Eurospeech 1997
ISCA Archive Eurospeech 1997

New methods in continuous Mandarin speech recognition

C. Julian Chen, Ramesh A. Gopinath, Michael D. Monkowski, Michael A. Picheny, Katherine Shen

We describe new methods for speaker-independent, continuous mandarin speech recognition based on the IBM HMM-based continuous speech recognition system (1-3): First, we treat tones in mandarin as attributes of certain phonemes, instead of syllables. Second, instantaneous pitch is treated as a variable in the acoustic feature vector, in the same way as cepstra or energy. Third, by designing a set of word-segmentation rules to convert the continuous Chinese text into segmented text, an effective trigram language model is trained(4). By applying those new methods, a speaker-independent, very-large-vocabulary continuous mandarin dictation system is demonstrated. Decoding results showed that its performance is similar to the best results for US English.


doi: 10.21437/Eurospeech.1997-444

Cite as: Chen, C.J., Gopinath, R.A., Monkowski, M.D., Picheny, M.A., Shen, K. (1997) New methods in continuous Mandarin speech recognition. Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997), 1543-1546, doi: 10.21437/Eurospeech.1997-444

@inproceedings{chen97_eurospeech,
  author={C. Julian Chen and Ramesh A. Gopinath and Michael D. Monkowski and Michael A. Picheny and Katherine Shen},
  title={{New methods in continuous Mandarin speech recognition}},
  year=1997,
  booktitle={Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997)},
  pages={1543--1546},
  doi={10.21437/Eurospeech.1997-444},
  issn={1018-4074}
}