ISCA Archive Interspeech 2010
ISCA Archive Interspeech 2010

Incorporating MAP estimation and covariance transform for SVM based speaker recognition

Cheung-Chi Leung, Donglai Zhu, Kong Aik Lee, Bin Ma, Haizhou Li

In this paper, we apply Constrained Maximum a Posteriori Linear Regression (CMAPLR) transformation on Universal Background Model (UBM) when characterizing each speaker with a supervector. We incorporate the covariance transformation parameters into the supervector in addition to the mean transformation parameters. Maximum Likelihood Linear Regression (MLLR) covariance transformation is adopted. The auxiliary function maximization involved in Maximum Likelihood (ML) and Maximum a Posteriori (MAP) estimation is also presented. Our experiment on the 2006 NIST Speaker Recognition Evaluation (SRE) corpus shows that the two proposed techniques provide substantial performance improvement.