ISCA Archive Eurospeech 1997
ISCA Archive Eurospeech 1997

Speech recognition using HMM-state confusion characteristics

Yumi Wakita, Harald Singer, Yoshinori Sagisaka

In our previous work, we proposed a re-entry modeling of missing phonemes which are lost during search process. In the reentry modeling, the recognition results are postprocessed and originally recognized phoneme sequences are converted to new phoneme sequences using HMM-state confusion characteristics spanning several phonemes. We confirmed that HMM-state confusions are effective for the re-entry modeling. In this paper, we propose a re- entry modeling during recognition using a multiple pronunciation dictionary where pronunciations are added using HMM-state confusion characteristics. The pronunciations are added considering part-of-speech (POS) dependency of confusion characteris- tics. As a result of continuous recognition experiments, we confirmed that the following two points are effective to improve word recognition rates: (1) confusions are expressed by HMM-state sequences, (2) pronunciations are added considering part-of-speech dependency of confusion characteristics. they cannot cope with the confusion in consideration of the previous and following context of misrecognized sequences.


doi: 10.21437/Eurospeech.1997-8

Cite as: Wakita, Y., Singer, H., Sagisaka, Y. (1997) Speech recognition using HMM-state confusion characteristics. Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997), 7-10, doi: 10.21437/Eurospeech.1997-8

@inproceedings{wakita97_eurospeech,
  author={Yumi Wakita and Harald Singer and Yoshinori Sagisaka},
  title={{Speech recognition using HMM-state confusion characteristics}},
  year=1997,
  booktitle={Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997)},
  pages={7--10},
  doi={10.21437/Eurospeech.1997-8},
  issn={1018-4074}
}