ISCA Archive AVSP 1999
ISCA Archive AVSP 1999

On the use of visual information for improving audio-based speaker recognition

Andrew Senior, Chalapathy V. Neti, Benoit Maison

Audio-based speaker identification degrades severely when there is a mismatch between training and test conditions either due to channel or noise. In this paper, we explore various techniques to fuse video based speaker identification with audio-based speaker identification to improve the performance under mismatch conditions.