ISCA Archive Eurospeech 1997
ISCA Archive Eurospeech 1997

Delta vector taylor series environment compensation for speaker recognition

Brian Eberman, Pedro J. Moreno

The performance of speaker recognition algorithms drops significantly when testing and training acoustic environments differ. This decrease is caused by the statistical mismatch between the statistics representing the speaker and the testing acoustic data. This paper reports our preliminary results on the application of a novel environmental compensation algorithm to the problem of speaker recognition and identification. This new technique, called the Delta Vector Taylor Series (DVTS) approach, improves performance at signal-to-noise ratios below 20dB. The algorithm imposes a model of how the envi- ronment modifies speaker statistics and uses Expectation- Maximization (EM) to solve a joint maximum likelihood formulation for the speaker recognition problem over both the speakers and the environment. We report experimental results on a subset of the TIMIT and NTIMIT database.


doi: 10.21437/Eurospeech.1997-614

Cite as: Eberman, B., Moreno, P.J. (1997) Delta vector taylor series environment compensation for speaker recognition. Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997), 2335-2338, doi: 10.21437/Eurospeech.1997-614

@inproceedings{eberman97_eurospeech,
  author={Brian Eberman and Pedro J. Moreno},
  title={{Delta vector taylor series environment compensation for speaker recognition}},
  year=1997,
  booktitle={Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997)},
  pages={2335--2338},
  doi={10.21437/Eurospeech.1997-614},
  issn={1018-4074}
}