ISCA Archive ICSLP 2002
ISCA Archive ICSLP 2002

Using EM-trained string-edit distances for approximate matching of acoustic morphemes

Michael Levit, Elmar Nöth, Allen Gorin

Our research concerns spoken language understanding within the domain of automated telecommunication services. In the recent papers we presented a new methodology for training of statistical language models for recognition and understanding of utterances from large corpora of phone sequences obtained as the output of a taskindependent ASR-system. The advantage of this strategy compared to the traditional word-based strategy is that we don’t have to manually transcribe large amounts of data in order to extract acoustic morphemes to train the classifier. Since the baseline strategy suffered high False Rejection Rates caused by finding no acoustic morphemes in the test data, we describe in this paper how approximate matching can be incorporated in the Bayes-classifier to reduce FRR. The experiments are evaluated for "How May I Help You?"-task.