ISCA Archive Interspeech 2011
ISCA Archive Interspeech 2011

Candidate generation for ASR output error correction using a context-dependent syllable cluster-based confusion matrix

Chao-Hong Liu, Chung-Hsien Wu, David Sarwono, Jhing-Fa Wang

Error correction techniques have been proposed in the applications of language learning and spoken dialogue systems for spoken language understanding. These techniques include two consecutive stages: the generation of correction candidates and the selection of correction candidates. In this study, a Context-Dependent Syllable Cluster (CD-SC)-based Confusion Matrix is proposed for the generation of correction candidates. A Contextual Fitness Score, measuring the sequential relationship to the neighbors of the candidate, is proposed for corrected syllable sequence selection. Finally, the n-gram language model is used to determine the final word sequence output. Experiments show that the proposed method improved from 0.742 to 0.771 in terms of BLEU score as compared to the conventional speech recognition mechanism.