ISCA Archive Blizzard 2018
ISCA Archive Blizzard 2018

The NTUT's Text-to-Speech System for Blizzard Challenge 2018

Yuan-Fu Liao, Ya-Bo Chai, Cheng-Hung Tsai

This paper describes our first Deep Neural Network (DNN)-based speech synthesis system submitted to Blizzard Challenge 2018. The focus of this work is to explore the capacities of DNNs. Therefore, our system is based on latest HMM/DNN-based Speech Synthesis System (HTS) toolkits (ver. 2.3.2). Although, the performance of our system is not good enough on naturalness and similarity, its speech pause and stress scores of audiobook paragraphs were almost above the average. This should be a good starting point for further performance improvement.