ISCA Archive Interspeech 2007
ISCA Archive Interspeech 2007

Hybridizing conversational and clear speech

Akiko Kusumoto, Alexander B. Kain, John-Paul Hosom, Jan P. H. van Santen

"Clear" ( clr) speech is a speaking style that speakers adopt to be understood correctly in a difficult communication environment. Studies have shown that clr speech, as opposed to "conversational" ( cnv) speech, has significantly higher intelligibility in various conditions. While many differences in acoustic features have been identified, it is not known which individual feature or combinations of features cause the higher intelligibility of clr speech. The objectives of the current study are to examine whether it is possible to improve speech intelligibility by approximating clr speech features and to determine which acoustic features contribute to intelligibility. Our approach creates speech samples that combine acoustic features of cnv and clr speech, using a hybridization algorithm. Results with normal-hearing listeners showed significant sentence-level intelligibility improvements of 11-23% over cnv speech when replacing certain acoustic features with those from clr speech.