ISCA Archive Interspeech 2010
ISCA Archive Interspeech 2010

Speaking style dependency of formant targets

Akiko Amano-Kusumoto, John-Paul Hosom, Alexander Kain

Previous work on formant targets has assumed that these targets are independent of the speaking style. In this paper, we estimate consonant and vowel targets in a database of “clear” and “conversational” speech, using both style-independent and style-dependent models. The test-set errors and clustering of the estimated target values indicate that for this corpus, formant targets depend on the speaking style. As an application, the vowel classification accuracy was tested with both style-indepently and dependently based on observed formant values and estimated target values. Token-based style-independent classification shows greater accuracy for conversational speech (82.19%) than observed-value classification (73.97%).