ISCA Archive ICSLP 1996
ISCA Archive ICSLP 1996

Speaker independent bimodal phonetic recognition experiments

Piero Cosi, E. Magno Caldognetto, Franco Ferrero, M. Dugatto, K. Vagges

A speaker independent bimodal phonetic classification experiment regarding the Italian plosive consonants is described. The phonetic classification scheme is based on a feed forward recurrent back-propagation neural network working on audio and visual information. The speech signal is processed by an auditory model producing spectral-like parameters, while the visual signal is processed by a specialized hardware, called ELITE, computing lip and jaw kinematics parameters.