ISCA Archive Eurospeech 1995
ISCA Archive Eurospeech 1995

The Dutch polyphone corpus

E. A. den Os, T. I. Boogaart, Lou Boves, Esther Klabbers

This paper first summarizes the work done to design, record, transcribe, and produce the Dutch Polyphone corpus. In addition, figures related to frequency of occurrence of diphones and triphones in the phonetically rich sentences are presented. Furthermore, we describe one way of using the corpus, viz. to derive information about the way callers pronounced telephone numbers, postal codes (both read and spontaneously spoken), amounts of money, times, and bank accounts. Finally, it is shown that hesitation phenomena occur most frequently within longer items.