ISCA Archive ICSLP 1990
ISCA Archive ICSLP 1990

Speaker weighted training of HMM using multiple reference speakers

Hiroaki Hattori, Satoshi Nakamura, Kiyohiro Shikano, Shigeki Sagayama

This paper proposes a new speaker adaptation method using speaker weights for multiple reference speaker training. The speaker weights are calculated to reflect the similarity of each reference speaker's dynamic features to an input speaker. They are used to have the similarities affect to hidden Markov models. The evaluation experiments are carried out through the /b,d,g,m,n,N/ phoneme recognition task using 8 speakers. Average recognition rates are 68.0%, 66.4%, and 65.6% respectively for three test sets which have different speech styles, that is, word utterances, phrase-by-phrase utterances and continuous utterances. These are 1.6%, 6.7%, and 8.2% respectively higher than the supplemented HMM rates.