ISCA Archive Interspeech 2012
ISCA Archive Interspeech 2012

Maximising objective speech intelligibility by local F0 modulation

Julián Villegas, Martin Cooke

We investigated the effect on objective speech intelligibility of scaling the fundamental frequency (F0) of voiced regions in a set of utterances. The frequency scaling was driven by max- imising the glimpse proportion in voiced epochs, inspired by musical consonance maximisation techniques. Results show that depending on the energetic masker and the signal to noise ratio, F0 modifications increased the mean glimpse proportion by up to 15 %. On average, lower mean F0 changes resulted in greater glimpse proportions. It was also found that the glimpse proportion could be a good predictor of music consonance.

Index Terms: roughness, glimpse proportion, objective speech intelligibility, musical consonance, fundamental frequency