ISCA Archive Eurospeech 2003
ISCA Archive Eurospeech 2003

Dependence of GMM adaptation on feature post-processing for speaker recognition

Robbie Vogt, Jason Pelecanos, Sridha Sridharan

This paper presents a study on the relationship between feature post-processing and speaker modelling techniques for robust text-independent speaker recognition. A fully coupled target and background Gaussian mixture speaker model structure is used for hypothesis testing in this speaker model based recognition system. Two formulations of the Maximum a Posteriori (MAP) adaptation algorithm for Gaussian mixture models are considered. We contrast the standard single iteration adaptation algorithm to adaptation using multiple iterations. Three post-processing techniques for cepstral features are considered; feature warping, cepstral mean subtraction (CMS) and RelAtive SpecTrA (RASTA) processing. It is shown that the advantage gained through iterative MAP adaptation is dependent on the parameterisation technique used. Reasons for this dependency are discussed.