ISCA Archive ISCSLP 2006
ISCA Archive ISCSLP 2006

Multilingual Text - Speech Corpus of Mongolian

I. Dawa, Husal, Liu Yue, Yue Yao Ming, Uulang, Bai Shuang Cheng, Batsaihan, Y. Arai, M. Mitsunaga, H. Isahara, S. Nakamura

Abstract:In this paper, we reported a multilingual parallel electronic dictionary( called MPEDMCJKE) and a multiple speech corpus(called MDSCM) of Mongolian. MPEDMCJKE is paralleled the languages of Mongolian(including the versions of Cyrillic, traditional Mongolian and Mongolian Todo used in Mongolia, China and Russia, respectively), Chinese, Japanese, Korean and English. And It is done through the international cooperation of the National Institute of Information and Communications Technology of Japan(NICT), MENKsoft Co., Ltd. and the Mongolian Information Technology Institute of Social Science of Inner Mongolia(MIT), China, and the Korea Advanced Institute of Science and Technology of Korea(KAIST). MDSCM is a multi-dialectal speech corpus of Mongolian collected from different areas or countries, and is done supported by Shirai laboratory in waseda university and ATR of Japan during 1998-2006. Keyword: Mongolian text-speech corpus, various versions and dialects, Multilingual dictionary.