ISCA Archive Eurospeech 1997
ISCA Archive Eurospeech 1997

Combinatorial issues in text-to-speech synthesis

Jan P. H. van Santen

Enhanced storage capacities and new learning algorithms have increased the role of text and speech training data bases in the construction of text-to-speech systems. It has become apparent, however, that not always learning algorithms are available that have strong generalization capabilities - the ability to generalize from cases seen in the training data base to new cases encountered during TTS operation. This makes it important to measure and understand the degree of coverage of the input domain of a text-to-speech system (usually, the entire language) by a given training data base. The goal of this paper is to investigate the feasibility of coverage in several domains of interest for TTS. It is shown that, as a result of the combinatorics of language, coverage is typically quite disappointing. This puts a premium on the generalization capability of learning algorithms.


doi: 10.21437/Eurospeech.1997-651

Cite as: Santen, J.P.H.v. (1997) Combinatorial issues in text-to-speech synthesis. Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997), 2507-2510, doi: 10.21437/Eurospeech.1997-651

@inproceedings{santen97c_eurospeech,
  author={Jan P. H. van Santen},
  title={{Combinatorial issues in text-to-speech synthesis}},
  year=1997,
  booktitle={Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997)},
  pages={2507--2510},
  doi={10.21437/Eurospeech.1997-651},
  issn={1018-4074}
}