ISCA Archive Interspeech 2012
ISCA Archive Interspeech 2012

Combining temporal and cepstral features for the automatic perceptual categorization of disordered connected speech

Ali Alpan, Jean Schoentgen, Francis Grenez

The objective of the presentation is to report experiments involving the automatic classification of disordered connected speech into multiple (modal, moderately hoarse, severely hoarse) categories. Support vector machines, used for the classification, have been fed with temporal signal-todysperiodicity ratios, the first rahmonic amplitude as well as mel-frequency cepstral coefficients. The signal-to-dysperiodicity ratio complements the first rahmonic amplitude when categorizing voice samples according to the degree of hoarseness yielding 77% of correct classification.

Index Terms: automatic perceptual categorization of disordered connected speech, variogram analysis, signal-to-dysperiodicity ratio, first amplitude rahmonic, mel-frequency cepstral coefficients, support vector machine