ISCA Archive Interspeech 2015
ISCA Archive Interspeech 2015

Phonemes frequency based PLLR dimensionality reduction for language recognition

Saad Irtza, Vidhyasaharan Sethu, Phu Ngoc Le, Eliathamby Ambikairajah, Haizhou Li

This paper presents a new approach to reduce the dimensionality of Phone Log likelihood Ratio (PLLR) features, which have been shown to be effective for language recognition, by removing the likelihoods corresponding to less frequent phonemes. In this work, phoneme frequencies are estimated using a suitable phoneme recogniser. Following this, an i-vector framework is used to represent the total variability in the reduced dimensional PLLR feature space. This paper also proposes the use of Gaussian probabilistic linear discriminant analysis (GPLDA) as a backend for Language Recognition Evaluation (LRE) tasks. The suitability of both, the proposed dimensionality reductions technique and the GPLDA back-end has been evaluated on NIST 2007 and 2011 LRE tasks. The results show that the novel dimensionality reduction method outperforms PCA based dimensionality reduction by 7%. Further the results also show that GPLDA outperform generatively trained Gaussian back-ends, which have previously been used in conjunction with PLLR feature, by 14.6%.