ISCA Archive Interspeech 2012
ISCA Archive Interspeech 2012

A correlational discriminant approach to feature extraction for robust speech recognition

Vikrant Singh Tomar, Richard C. Rose

A non-linear discriminant analysis based approach to feature space dimensionality reduction in noise robust automatic speech recognition (ASR) is proposed. It utilizes a correlation based distance measure instead of the conventional Euclidean distance. The use of this "correlation preserving discriminant analysis" (CPDA) procedure is motivated by evidence suggesting that correlation based cepstrum distance measures can be more robust than Euclidean based distances when speech is corrupted by noise. The performance of CPDA is evaluated in terms of the word error rate obtained by using CPDA derived features on the Aurora 2 speech in noise corpus, and is compared to the commonly used linear discriminant analysis (LDA) approach to feature space transformations.

Index Terms: Correlation preserving discriminant analysis, graph embedding, dimensionality reduction, speech recognition


doi: 10.21437/Interspeech.2012-171

Cite as: Tomar, V.S., Rose, R.C. (2012) A correlational discriminant approach to feature extraction for robust speech recognition. Proc. Interspeech 2012, 555-558, doi: 10.21437/Interspeech.2012-171

@inproceedings{tomar12_interspeech,
  author={Vikrant Singh Tomar and Richard C. Rose},
  title={{A correlational discriminant approach to feature extraction for robust speech recognition}},
  year=2012,
  booktitle={Proc. Interspeech 2012},
  pages={555--558},
  doi={10.21437/Interspeech.2012-171},
  issn={2958-1796}
}