ISCA Archive Interspeech 2012
ISCA Archive Interspeech 2012

Probabilistic speaker-class based acoustic modeling for large vocabulary continuous speech recognition

Xiangang Li, Dan Su, Zaihu Pang, Xihong Wu

In this paper, a probabilistic speaker-class (PSC) based acoustic modeling method is proposed for taking into account speaker variability influence in HMM-based speech recognition systems. Firstly, within the context of speaker-class based speech recognition, an experiment is conducted to investigate the performance of speaker-class recognition based on hard-cut speaker clustering. Then, in the proposed method, through introducing the probabilistic latent speaker analysis, the speaker-class dependent acoustic models are trained based on a softdecision speaker clustering method, and combined by the distribution of speaker-class in the decoding phase. The experiments were conducted on a 600-hour speech corpus, and showed improvement in a large vocabulary continuous speech recognition task.

Index Terms: speech recognition, probabilistic latent speaker analysis, speaker clustering, speaker-class