ISCA Archive Eurospeech 1997
ISCA Archive Eurospeech 1997

Design and analysis of a German telephone speech database for phoneme based training

Stefan Feldes, Bernhard Kaspar, Denis Jouvet

Based on the Sotscheck text corpus, we developped a new corpus that was specifically optimised for training phoneme-based recognition systems. Particular attention was payed on good coverage of phone transitions. Even though the resulting corpus is only slightly enlarged, it shows an increased phonetic coverage while maintaining a good phonetic balance. Results of phonetic statistical analysis and of experiments for training an allophone-based recognizer are reported here.