ISCA Archive Interspeech 2006
ISCA Archive Interspeech 2006

Feature normalization using smoothed mixture transformations

Patrick Kenny, Vishwa Gupta, G. Boulianne, Pierre Ouellet, Pierre Dumouchel

We propose a method for estimating the parameters of SPLICE-like transformations from individual utterances so that this type of transformation can be used to normalize acoustic feature vectors for speech recognition on an utterance-by-utterance basis in a similar manner to cepstral mean normalization. We report results on an in-house French language multi-speaker database collected while deploying an automatic closed-captioning system for live broadcast news. An unusual feature of this database is that there are very large amounts of training data for the individual speakers (typically several hours) so that it is very difficult to improve on multi-speaker modeling by using standard methods of speaker adaptation. We found that the proposed method of feature normalization is capable of achieving a 6% relative improvement over cepstral mean normalization on this task.

doi: 10.21437/Interspeech.2006-7

Cite as: Kenny, P., Gupta, V., Boulianne, G., Ouellet, P., Dumouchel, P. (2006) Feature normalization using smoothed mixture transformations. Proc. Interspeech 2006, paper 1026-Mon1A2O.1, doi: 10.21437/Interspeech.2006-7

  author={Patrick Kenny and Vishwa Gupta and G. Boulianne and Pierre Ouellet and Pierre Dumouchel},
  title={{Feature normalization using smoothed mixture transformations}},
  booktitle={Proc. Interspeech 2006},
  pages={paper 1026-Mon1A2O.1},