ISCA Archive ECST 1987
ISCA Archive ECST 1987

Acoustic-phonetic labels in a Japanese speech database

Kazuya Takeda, Yoshinori Sagisaka, Shigeru Katagiri

A large sized Japanese speech database at ATR(JSDB-ATR) is introduced. These speech data are transcribed in multiple ways using acoustic-phonetic symbols for various data access requests and for the convenience of fine acoustic-phonetic analysis. For multiple transcription, three types of categories are considered: linguistic and phonemic categories, acoustic event categories and some alophonic variation categories. To date, about 8500 words respectively uttered by eight professional announcers have been collected with half of them being acoustically-phonetically transcribed.