ISCA Archive Eurospeech 1997
ISCA Archive Eurospeech 1997

High-quality speech synthesis for phonetic speech segmentation

Fabrice Malfrere, Thierry Dutoit

This paper presents an original technique for solving the phonetic segmentation problem. It is based on the use of a speech synthesizer for the alignment of a text on its corresponding speech signal. A high-quality digital speech synthesizer is used to create a synthetic reference speech pattern used in the alignment process. This approach has the great advantage on other approaches that no training stage (hence no labeled database) is needed. The system has been mainly evaluated on French read utterances. Other evaluations have been made on other languages like English, German, Romanian and Spanish. Following these experiments, the system seems to be a powerful tool for the automatic constitution of large phonetically and prosodically labeled speech databases. The availability of such corpora will be a key point for the development of improved speech synthesis and recognition systems.


doi: 10.21437/Eurospeech.1997-664

Cite as: Malfrere, F., Dutoit, T. (1997) High-quality speech synthesis for phonetic speech segmentation. Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997), 2631-2634, doi: 10.21437/Eurospeech.1997-664

@inproceedings{malfrere97_eurospeech,
  author={Fabrice Malfrere and Thierry Dutoit},
  title={{High-quality speech synthesis for phonetic speech segmentation}},
  year=1997,
  booktitle={Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997)},
  pages={2631--2634},
  doi={10.21437/Eurospeech.1997-664},
  issn={1018-4074}
}