ISCA Archive Interspeech 2014
ISCA Archive Interspeech 2014

UBM fused total variability modeling for language identification

Maarten van Segbroeck, Ruchir Travadi, Shrikanth S. Narayanan

This paper proposes Universal Background Model (UBM) fusion in the framework of total variability or i-vector modeling with the application to language identification (LID). The total variability subspace which is typically exploited to discriminate between the language classes of different speech recordings, is trained by combining the normalized Baum-Welch statistics of multiple UBMs. When the UBMs model a diverse set of feature representations, the method yields an i-vector representation which is more discriminant between the classes of interest. This approach is particularly useful when applied to short-duration utterances, and is a computationally less complex alternative to performance boosting as compared to system level fusion. We assess the performance of UBM fused total variability modeling on the task of robust language identification on short-duration utterances, as part of Phase-III of the DARPA RATS (Robust Automatic Transcription of Speech) program.


doi: 10.21437/Interspeech.2014-607

Cite as: Segbroeck, M.v., Travadi, R., Narayanan, S.S. (2014) UBM fused total variability modeling for language identification. Proc. Interspeech 2014, 3027-3031, doi: 10.21437/Interspeech.2014-607

@inproceedings{segbroeck14b_interspeech,
  author={Maarten van Segbroeck and Ruchir Travadi and Shrikanth S. Narayanan},
  title={{UBM fused total variability modeling for language identification}},
  year=2014,
  booktitle={Proc. Interspeech 2014},
  pages={3027--3031},
  doi={10.21437/Interspeech.2014-607},
  issn={2308-457X}
}