ISCA Archive SpeechProsody 2014
ISCA Archive SpeechProsody 2014

The OMe (Octave-Median) scale: a natural scale for speech melody.

Céline De Looze, Daniel Hirst

Fundamental frequency, the primary acoustic correlate of speech melody, is generally analysed and displayed using a linear scale (Hertz) or a logarithmic one, generally in semitones and usually offset to an arbitrary reference level such as 100 Hz. In this paper we argue that a more natural scale for analysing speech is the OME (Octave-MEdian) scale, using the octave (o) as the basic unit, offset to the median value of the speaker’s range. We present results showing that a reasonable estimate of a speaker’s pitch range can be obtained directly from the median.