ISCA Archive Eurospeech 1997
ISCA Archive Eurospeech 1997

On using fractal features of speech sounds in automatic speech recognition

Petros Maragos, Alexandros Potamianos

The dynamics of air ow during speech production may often result into some small or large degree of turbulence. In this paper, we quantify the geometry of speech turbulence as reflected in the fragmentation of the time signal by using fractal models. We describe an efficient algorithm for estimating the short-time fractal dimension of speech signals based on multiscale morphological filtering and discuss its potential for phonetic classification. We also report experimental results on using the short- time fractal dimension of speech signals at multiple scales as additional features in an automatic speech recognition system using hidden Markov models, which provides a modest improvement in speech recognition performance. dimensions of speech segments as additional features in an automatic speech recognition system based on hidden Markov models (HMMs) and found them to offer a modest improvement to the speech recognition performance.


doi: 10.21437/Eurospeech.1997-657

Cite as: Maragos, P., Potamianos, A. (1997) On using fractal features of speech sounds in automatic speech recognition. Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997), 2531-2534, doi: 10.21437/Eurospeech.1997-657

@inproceedings{maragos97_eurospeech,
  author={Petros Maragos and Alexandros Potamianos},
  title={{On using fractal features of speech sounds in automatic speech recognition}},
  year=1997,
  booktitle={Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997)},
  pages={2531--2534},
  doi={10.21437/Eurospeech.1997-657},
  issn={1018-4074}
}