ISCA Archive Eurospeech 2001
ISCA Archive Eurospeech 2001

Using aerial and geometric features in automatic lip-reading

Jacek C. Wojdel, Leon J. M. Rothkrantz

In this paper we present the lip-reading experiments with different sets of the features extracted from the video sequence. In our experiments we use a simple color based filtering techniques to extract the feature vectors from the incoming video signal. Some of those features are directly related to the geometrical properties of the lips (their position and visible thickness). Other features represent the information that relates to the visibility of the other components of the speech production system. The visibility of the teeth and vocal tract for example is described by means of the area they occupy in the image, we call them therefore the aerial features.