ISCA Archive Eurospeech 1997
ISCA Archive Eurospeech 1997

Improvements on a trainable letter-to-sound converter

Li Jiang, Hsiao-Wuen Hon, Xuedong Huang

Letter-to-sound (LTS) conversion is important for both text-to-speech (TTS) and automatic speech recognition (ASR). In this paper we discuss some improvements we have made on our trainable LTS converter. We use a classification and regression tree (CART) to automatically configure the most salient phonological rules needed for the LTS conversion. We address problems in growing multiple trees and use of phonotactic information for better generalization. The experiments were carried on both the NETTALK database and the CMU dictionary. With improved techniques, the conversion error rate at the phoneme level and word level was reduced by 15% and 20% respectively. For both tasks, the phoneme conversion error rate was reduced to about 8%.


doi: 10.21437/Eurospeech.1997-220

Cite as: Jiang, L., Hon, H.-W., Huang, X. (1997) Improvements on a trainable letter-to-sound converter. Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997), 605-608, doi: 10.21437/Eurospeech.1997-220

@inproceedings{jiang97_eurospeech,
  author={Li Jiang and Hsiao-Wuen Hon and Xuedong Huang},
  title={{Improvements on a trainable letter-to-sound converter}},
  year=1997,
  booktitle={Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997)},
  pages={605--608},
  doi={10.21437/Eurospeech.1997-220},
  issn={1018-4074}
}