ISCA Archive Eurospeech 1997
ISCA Archive Eurospeech 1997

Using formant frequencies in speech recognition

John N. Holmes, Wendy J. Holmes, Philip N. Garner

Formant frequencies have rarely been used as acoustic features for speech recognition, in spite of their phonetic significance. For some speech sounds one or more of the formants may be so badly defined that it is not useful to attempt a frequency measurement. Also, it is often difficult to decide which formant labels to attach to particular spectral peaks. This paper describes a new method of formant analysis which includes techniques to overcome both of the above difficulties. Using the same data and HMM model structure, results are compared between a recognizer using conventional cepstrum features and one using three formant frequencies, combined with fewer cepstrum features to represent general spectral trends. For the same total number of features, results show that including formant features can offer increased accuracy over using cepstrum features only.


doi: 10.21437/Eurospeech.1997-551

Cite as: Holmes, J.N., Holmes, W.J., Garner, P.N. (1997) Using formant frequencies in speech recognition. Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997), 2083-2086, doi: 10.21437/Eurospeech.1997-551

@inproceedings{holmes97_eurospeech,
  author={John N. Holmes and Wendy J. Holmes and Philip N. Garner},
  title={{Using formant frequencies in speech recognition}},
  year=1997,
  booktitle={Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997)},
  pages={2083--2086},
  doi={10.21437/Eurospeech.1997-551},
  issn={1018-4074}
}