ISCA Archive PMLA 2002
ISCA Archive PMLA 2002

On the road to improved lexical confusability metrics

Eric Fosler-Lussier, Ingunn Amdal, Hong-Kwang Jeff Kuo

Pronunciation modeling in automatic speech recognition systems has had mixed results in the past; one likely reason for poor performance is the increased confusability in the lexicon from adding new pronunciation variants. In this work, we propose a new framework for determining lexically confusable words based on inverted finite state transducers (FSTs); we also present experiments designed to test some of the implementation details of this framework. The method is evaluated by looking at how well the algorithm predicts the errors in an ASR system. We see from the confusions learned in a training set that we are able to generalize this information to predict errors in an unseen test set.