Automatic detection of inhalation breath pauses for improved pause modelling in HMM-TTS
Norbert Braunschweiler, Langzhou Chen
Role of pausing in text-to-speech synthesis for simultaneous interpretation
Vivek Kumar Rangarajan Sridhar, John Chen, Srinivas Bangalore, Alistair Conkie
Minimum error rate training for phrasing in speech synthesis
Alok Parlikar, Alan W. Black
HMM-based speech synthesis of live sports commentaries: integration of a two-layer prosody annotation
Benjamin Picart, Sandrine Brognaux, Thomas Drugman
Investigation of intra-speaker spectral parameter variation and its prediction towards improvement of spectral conversion metric
Tatsuo Inukai, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura
Text to speech in new languages without a standardized orthography
Sunayana Sitaram, Gopala Krishna Anumanchipalli, Justin Chiu, Alok Parlikar, Alan W. Black
Unsupervised and lightly-supervised learning for rapid construction of TTS systems in multiple languages from ‘found’ data: evaluation and analysis
Oliver Watts, Adriana Stan, Robert A. J. Clark, Yoshitaka Mamiya, Mircea Giurgiu, Junichi Yamagishi, Simon King
A phonetic-contrast motivated adaptation to control the degree-of-articulation on Italian HMM-based synthetic voices
Mauro Nicolao, Fabio Tesser, Roger K. Moore
Using neighbourhood density and selective SNR boosting to increase the intelligibility of synthetic speech in noise
Cassia Valentini-Botinhao, Mirjam Wester, Junichi Yamagishi, Simon King
Noise robustness in HMM-TTS speaker adaptation
Kayoko Yanagisawa, Javier Latorre, Vincent Wan, Mark J. F. Gales, Simon King
New method for rapid vocal tract length adaptation in HMMbased speech synthesis
Daniel Erro, Agustin Alonso, Luis Serrano, Eva Navas, Inma Hernaez
Text-to-speech synthesizer based on combination of composite wavelet and hidden Markov models
Nobukatsu Hojo, Kota Yoshizato, Hirokazu Kameoka, Daisuke Saito, Shigeki Sagayama
An experimental comparison of multiple vocoder types
Qiong Hu, Korin Richmond, Junichi Yamagishi, Javier Latorre
Statistical model training technique for speech synthesis based on speaker class
Yusuke Ijima, Noboru Miyazaki, Hideyuki Mizuno
Mage - reactive articulatory feature control of HMM-based parametric speech synthesis
Maria Astrinaki, Alexis Moinet, Junichi Yamagishi, Korin Richmond, Zhen-Hua Ling, Simon King, Thierry Dutoit
Systematic database creation for expressive singing voice synthesis control
Martí Umbert, Jordi Bonada, Merlijn Blaauw
Expressive speech synthesis: synthesising ambiguity
Matthew P. Aylett, Blaise Potard, Christopher J. Pidcock
Interactional adequacy as a factor in the perception of synthesized speech
Timo Baumann, David Schlangen
A novel irregular voice model for HMM-based speech synthesis
Tamás Gábor Csapó, Géza Németh
Expression of speaker’s intentions through sentence-final particle/ intonation combinations in Japanese conversational speech synthesis
Kazuhiko Iwata, Tetsunori Kobayashi
Unified numerical simulation of the physics of voice. the EUNISON project
Oriol Guasch, Sten Ternström, Marc Arnela, Francesc Alías
Mage - HMM-based speech synthesis reactively controlled by the articulators
Maria Astrinaki, Alexis Moinet, Junichi Yamagishi, Korin Richmond, Zhen-Hua Ling, Simon King, Thierry Dutoit
Reactive accent interpolation through an interactive map application
Maria Astrinaki, Junichi Yamagishi, Simon King, Nicolas d’Alessandro, Thierry Dutoit
Real-time control of expressive speech synthesis using kinect body tracking
Christophe Veaux, Maria Astrinaki, Keiichiro Oura, Robert A. J. Clark, Junichi Yamagishi
Parametric model for vocal effort interpolation with harmonics plus noise models
Àngel Calzada Defez, Joan Claudi Socoró Carrié, Robert A. J. Clark
Vietnamese HMM-based speech synthesis with prosody information
Anh-Tuan Dinh, Thanh-Son Phan, Tat-Thang Vu, Chi Mai Luong
Context labels based on "bunsetsu" for HMM-based speech synthesis of Japanese
Hiroya Hashimoto, Keikichi Hirose, Nobuaki Minematsu
Using adaptation to improve speech transcription alignment in noisy and reverberant environments
Yoshitaka Mamiya, Adriana Stan, Junichi Yamagishi, Peter Bell, Oliver Watts, Robert A. J. Clark, Simon King
Speech synthesis using a maximally decimated pseudo QMF bank for embedded devices
Nobuyuki Nishizawa, Tsuneo Kato
HMM-based scost quality control for unit selection speech synthesis
Sathish Pammi, Marcela Charfuelan
Understanding factors in emotion perception
Lakshmi Saheer, Blaise Potard
Multilingual number transcription for text-to-speech conversion
Rubén San-Segundo, Juan Manuel Montero, Mircea Giurgiu, Ioana Muresan, Simon King
Noise-robust voice conversion based on spectral mapping on sparse space
Ryoichi Takashima, Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki
Cross-variety speaker transformation in HSMM-based speech synthesis
Markus Toman, Michael Pucher, Dietmar Schabus
Multi-variety adaptive acoustic modeling in HSMM-based speech synthesis
Markus Toman, Michael Pucher, Dietmar Schabus
Is intelligibility still the main problem? a review of perceptual quality dimensions of synthetic speech
Florian Hinterleitner, Christoph Norrenbrock, Sebastian Möller
Evaluation of contextual descriptors for HMM-based speech synthesis in French
Sébastien Le Maguer, Nelly Barbot, Olivier Boeffard
Towards speaking style transplantation in speech synthesis
Jaime Lorenzo-Trueba, Roberto Barra-Chicote, Junichi Yamagishi, Oliver Watts, Juan Manuel Montero
Investigating the shortcomings of HMM synthesis
Thomas Merritt, Simon King
Prosodic analysis of storytelling discourse modes and narrative situations oriented to text-to-speech synthesis
Raúl Montaño, Francesc Alías, Josep Ferrer
Objective evaluation measures for speaker-adaptive HMM-TTS systems
Ulpu Remes, Reima Karhila, Mikko Kurimo
Experiments with signal-driven symbolic prosody for statistical parametric speech synthesis
Fabio Tesser, Giacomo Sommavilla, Giulio Paci, Piero Cosi
Significance of word-terminal syllables for prediction of phrase breaks in text-to-speech systems for Indian languages
Anandaswarup Vadapalli, Peri Bhaskararao, Kishore Prahallad
The effect of age and native speaker status on synthetic speech intelligibility
Catherine Watson, Wei Liu, Bruce MacDonald
Exemplar-based voice conversion using non-negative spectrogram deconvolution
Zhizheng Wu, Tuomas Virtanen, Tomi Kinnunen, Eng Siong Chng, Haizhou Li
SASSC: a standard Arabic single speaker corpus
Ibrahim Almosallam, Atheer Alkhalifa, Mansour Alghamdi, Mohamed Alkanhal, Ashraf Alkhairy
Prosodically modifying speech for unit selection speech synthesis databases
Ladan Golipour, Alistair Conkie, Ann Syrdal
Combining a vector space representation of linguistic context with a deep neural network for text-to-speech synthesis
Heng Lu, Simon King, Oliver Watts
Is unit selection aware of audible artifacts?
Jindřich Matoušek, Daniel Tihelka, Milan Legát
Development of electrolarynx with hands-free prosody control
Kenji Matsui, Kenta Kimura, Yoshihisa Nakatoh, Yumiko O. Kato
A hybrid TTS between unit selection and HMM-based TTS under limited data conditions
Trung-Nghia Phung, Chi Mai Luong, Masato Akagi
Wavelets for intonation modeling in HMM speech synthesis
Antti Suni, Daniel Aalto, Tuomo Raitio, Paavo Alku, Martti Vainio
A common attribute based unified HTS framework for speech synthesis in Indian languages
B. Ramani, S. Lilly Christina, G. Anushiya Rachel, V. Sherlin Solomi, Mahesh Kumar Nandwana, Anusha Prakash, S. Aswin Shanmugam, Raghava Krishnan, S. Kishore Prahalad, K. Samudravijaya, P. Vijayalakshmi, T. Nagarajan, Hema A. Murthy
Cross-lingual speaker adaptation based on factor analysis using bilingual speech data for HMM-based speech synthesis
Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda
Residual compensation based on articulatory feature-based phone clustering for hybrid Mandarin speech synthesis
Yi-Chin Huang, Chung-Hsien Wu, Shih-Lun Lin
| Article |
|---|