ISCA Archive Eurospeech 1997
ISCA Archive Eurospeech 1997

Speaker adaptation based on pre-clustering training speakers

Yuqing Gao, Mukund Padmanabhan, Michael Picheny

A new strategy for speaker adaptation is described that is based on: (1) pre-clustering all the speakers in the training set acoustically into clusters; (2) for each speaker cluster, a system is built using the data from the speakers who belong to the cluster; (3) when a test speaker's data is available, we find a subset of these clusters, closest to the test speaker; (4) we transform each of the selected clusters to bring it closer to the test speaker's acoustic space; (5) we build a speaker-adapted model using transformed cluster models. This method solves the problem of excessive storage for the training speaker models [1] , as it is relatively inexpensive to store a model for each cluster. Also as each cluster contains a number of speakers, parameters of the models for each cluster can be robustly estimated. The algorithm has been evaluated on a large vocabulary system and comparied to existing algorithms. The imporvement over existing algorithms such as MLLR [2] is statistically significant.


doi: 10.21437/Eurospeech.1997-553

Cite as: Gao, Y., Padmanabhan, M., Picheny, M. (1997) Speaker adaptation based on pre-clustering training speakers. Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997), 2091-2094, doi: 10.21437/Eurospeech.1997-553

@inproceedings{gao97_eurospeech,
  author={Yuqing Gao and Mukund Padmanabhan and Michael Picheny},
  title={{Speaker adaptation based on pre-clustering training speakers}},
  year=1997,
  booktitle={Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997)},
  pages={2091--2094},
  doi={10.21437/Eurospeech.1997-553},
  issn={1018-4074}
}