ISCA Archive Eurospeech 1997
ISCA Archive Eurospeech 1997

Dealing with pronunciation variants at the language model level for the continuous automatic speech recognition of French

Guy Pérennou, L. Pousse

In this paper, we describe three approaches of continuous speech recognition. Two of them (referred to as (W,P) and (W',P) models) take into account pronunciation variants of words. They allow to handle (very common) phonological french phenomena like liaisons or mute-e elision. The (W',P) model introduces the phonotypical level as defined in the MHAT Model [4,5]. Comparing (W,P) and (W',P) models show a significant improvement in recognition accuracy when a contextual language model is introduced at this phonotypical level.