ISCA Archive AVSP 2007
ISCA Archive AVSP 2007

Audiovisual Lombard speech: reconciling production and perception

Eric Vatikiotis-Bateson, Adriano V. Barbosa, Cheuk Yi Chow, Martin Oberg, Johanna Tan, Hani C. Yehia

An earlier study compared audiovisual perception of speech ’produced in environmental noise’ (Lombard speech) and speech ’produced in quiet’ with the same environmental noise added. The results and showed that listeners make differential use of the visual information depending on the recording condition, but gave no indication of how or why this might be so. A possible confound in that study was that high audio presentation levels might account for the small visual enhancements observed for Lombard speech. This paper reports results for a second perception study using much lower acoustic presentation levels, compares them with the results of the previous study, and integrates the perception results with analyses of the audiovisual production data: face and head motion, audio amplitude (RMS), and parameters of the spectral acoustics (line spectrum pairs).