ISCA Archive Eurospeech 1997
ISCA Archive Eurospeech 1997

Pronuncation modeling applied to automatic segmentation of spontaneous speech

Andreas Kipp, Maria-Barbara Wesenick, Florian Schiel

In this paper 1 two different models of pronunciation are presented: the first model is based on a rule set compiled by an expert, while the second is statistically based, exploiting a survey about pronunciation variants occurring in training data. Both models generate pronunciation variants from the canonic forms of words. The two models are evaluated by applying them to the task of automatic segmentation of speech and then comparing the results to manual segmentations of the same speech data. Results show that correspondence between manual and automatic segmentations can be significantly improved if pronunciation variants are taken into account. The statistical model outperforms the rule based model.


doi: 10.21437/Eurospeech.1997-358

Cite as: Kipp, A., Wesenick, M.-B., Schiel, F. (1997) Pronuncation modeling applied to automatic segmentation of spontaneous speech. Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997), 1023-1026, doi: 10.21437/Eurospeech.1997-358

@inproceedings{kipp97_eurospeech,
  author={Andreas Kipp and Maria-Barbara Wesenick and Florian Schiel},
  title={{Pronuncation modeling applied to automatic segmentation of spontaneous speech}},
  year=1997,
  booktitle={Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997)},
  pages={1023--1026},
  doi={10.21437/Eurospeech.1997-358},
  issn={1018-4074}
}