ISCA Archive Eurospeech 1993
ISCA Archive Eurospeech 1993

TELEMACO - a real time keyword spotting application for voice dialling

Eduardo Lleida, Jose B. Marino, Arturo Moreno

The problem of detecting a given set of words in fluent speech is one of the most interesting topics in speech recognition for practical real time applications. This paper present the TELEMACO system for automatic voice dialling which is based on the use of the keyword spotting technology to detect the dialling commands in fluent speech used by the IBERCOM Spanish telephone system. The user interface is based on a PC computer with a DSP board. The DSP board runs the speech recognition task and the interaction with the telephone line. The keyword vocabulary is composed by commands to dial, answer, hang-up, cancel, recall, store, etc. Each keyword is modeled by means of a discrete Hidden Markov Model. To model the non-keyword speech, syllabic fillers models and background models are used. The keyword spotting algorithm is a null grammar time-synchronous Viterbi search with two search spaces. The first search is over all the models (keywords and fillers) and the second search is only over the filler model. Thus, we can compare the behaviour of the filler model with the candidate keyword for each detection and decide if the keyword has been uttered or not. This process is done frame by frame. When a keyword is detected, the DSP board send the recognition word to the PC to take the corresponding action. The system has been implemented in a Windows environment.

Keywords: Keyword spotting, hidden Markov models, filler models, voice dialling