ISCA Archive Interspeech 2006
ISCA Archive Interspeech 2006

Two-step unsupervised speaker adaptation based on speaker and gender recognition and HMM combination

Petr Cerva, Jan Nouza, Jan Silovsky

In this paper, we present a new strategy for unsupervised speaker adaptation. In our approach, the adaptation is performed in two steps for each test utterance. In the first online step, we utilize speaker and gender identification, a set of speaker dependent (SD) hidden Markov models (HMMs) and our own fast linear model combination approach to create a proper model for the first speech recognition pass. After that the recognized phonetic transcription of the utterance is used for maximum likelihood (ML) estimation of more accurate weights for the final model combination step. Our experimental results on different types of broadcast programs show that the proposed method is capable to reduce the word error rate (WER) relatively by more than 17%.


doi: 10.21437/Interspeech.2006-98

Cite as: Cerva, P., Nouza, J., Silovsky, J. (2006) Two-step unsupervised speaker adaptation based on speaker and gender recognition and HMM combination. Proc. Interspeech 2006, paper 1441-Thu1CaP.7, doi: 10.21437/Interspeech.2006-98

@inproceedings{cerva06_interspeech,
  author={Petr Cerva and Jan Nouza and Jan Silovsky},
  title={{Two-step unsupervised speaker adaptation based on speaker and gender recognition and HMM combination}},
  year=2006,
  booktitle={Proc. Interspeech 2006},
  pages={paper 1441-Thu1CaP.7},
  doi={10.21437/Interspeech.2006-98},
  issn={2958-1796}
}