ISCA Archive Eurospeech 2001
ISCA Archive Eurospeech 2001

Unit selection for speech synthesis using splicing costs with weighted finite state transducers

Ivan Bulyko, Mari Ostendorf

In this paper we describe how unit selection for concatenative speech synthesis can be implemented efficiently for sub-phonetic units using weighted finite state transducers (WFST). We also introduce splicing costs as a measure to indicate which unit boundaries are particularly good or poor joint points. Splicing costs extend the flexibility offered by the unit selection paradigm. Through a perceptual experiment we demonstrate an improvement in speech quality achieved by using splicing costs during unit selection.