ISCA Archive Interspeech 2014
ISCA Archive Interspeech 2014

Towards a perceptual model of speech rhythm: integrating the influence of f0 on perceived duration

Robert Fuchs

Previous accounts of speech rhythm focus mainly on duration. For example, the normalised Pairwise Variability Index for vocalic intervals (nPVI-V) quantifies relative duration differences between successive vocalic intervals. Prototypical syllable-timing is characterised by small differences in duration, prototypical stress-timing by large differences. However, differences in F0 between vocalic intervals are thought to influence the perception of duration. This paper (1) quantifies the influence of differences in F0 on perceived duration in a perception experiment, and (2) suggests a modified PVI (nPVI-V(dur*F0)) that takes account of this influence. The new nPVI-V(dur*F0) is then applied to a speech corpus of (stress-timed) British English and (syllable-timed) Indian English. The results are compared to the application of the old nPVI-V, which takes into account duration only, to the same data set.


doi: 10.21437/Interspeech.2014-440

Cite as: Fuchs, R. (2014) Towards a perceptual model of speech rhythm: integrating the influence of f0 on perceived duration. Proc. Interspeech 2014, 1949-1953, doi: 10.21437/Interspeech.2014-440

@inproceedings{fuchs14_interspeech,
  author={Robert Fuchs},
  title={{Towards a perceptual model of speech rhythm: integrating the influence of f0 on perceived duration}},
  year=2014,
  booktitle={Proc. Interspeech 2014},
  pages={1949--1953},
  doi={10.21437/Interspeech.2014-440},
  issn={2308-457X}
}