GMM-PCA based speaker-timbre conversion on full-quality speech
Fernando Villavicencio, Esteban Maestre
Voice conversion using precise speech alignment based on spectral property and eigen-codeword distribution
Yi-Chin Huang, Chung-Hsien Wu, Chung-Han Lee, Yu-Ting Chao
On transforming spectral peaks in voice conversion
Elizabeth Godoy, Olivier Rosec, Thierry Chonavel
Linear transformation approaches to many-to-one voice conversion
Chie Hayashida, Tomoki Toda, Yamato Ohtani, Hiroshi Saruwatari, Kiyohiro Shikano
HMM-based robust voice conversion using adaptive F0 quantization
Takashi Nose, Takao Kobayashi
Statistical parametric speech synthesis with joint estimation of acoustic and excitation model parameters
Ranniery Maia, Heiga Zen, M. J. F. Gales
From discontinuous to continuous F0 modelling in HMM-based speech synthesis
Kai Yu, Blaise Thomson, Steve Young
Spectral modeling with contextual additive structure for HMM-based speech synthesis
Shinji Takaki, Yoshihiko Nankaku, Keiichi Tokuda
Bayesian speech synthesis framework integrating training and synthesis processes
Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda
Symbolic vs. acoustics-based style control for expressive unit selection
Ingmar Steiner, Marc Schröder, Marcela Charfuelan, Annette Klepp
Application of expressive TTS synthesis in an advanced ECA system
Jan Romportl, Enrico Zovato, Raúl Santos, Pavel Ircing, José Relaño Gil, Morena Danieli
A hidden Markov model-based approach for emotional speech synthesis
Chih-Yung Yang, Chia-Ping Chen
Two vocoder techniques for neutral to emotional timbre conversion
Fabio Tesser, Enrico Zovato, Mauro Nicolao, Piero Cosi
Evaluating speech synthesis intelligibility using Amazon Mechanical Turk
Maria K. Wolters, Karl B. Isaac, Steve Renals
Further exploration of the possibilities and pitfalls of multidimensional scaling as a tool for the evaluation of the quality of synthesized speech
Anna C. Janska, Robert A. J. Clark
Handling large audio files in audio books for building synthetic voices
Kishore Prahallad, Alan W. Black
Improving speech synthesis for noisy environments
Gopala Krishna Anumanchipalli, Prasanna Kumar Muthukumar, Udhyakumar Nallasamy, Alok Parlikar, Alan W. Black, Brian Langner
Learning speaker-specific phrase breaks for text-to-speech systems
Kishore Prahallad, E. Veera Raghavendra, Alan W. Black
Substitution of state distributions to reproduce natural prosody on HMM-based speech synthesizers
Nobuyuki Nishizawa, Tsuneo Kato
Utilising spontaneous conversational speech in HMM-based speech synthesis
Sebastian Andersson, Junichi Yamagishi, Robert A. J. Clark
Speech acts and dialog TTS
Ann K. Syrdal, Alistair Conkie, Yeon-Jun Kim, Mark C. Beutnagel
HMM-based polyglot speech synthesis by speaker and language adaptive training
Heiga Zen, Norbert Braunschweiler, Sabine Buchholz, Kate Knill, Sacha Krstulovic, Javier Latorre
Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project
Mirjam Wester, John Dines, Matthew Gibson, Hui Liang, Yi-Jian Wu, Lakshmi Saheer, Simon King, Keiichiro Oura, Philip N. Garner, William Byrne, Yong Guan, Teemu Hirsimäki, Reima Karhila, Mikko Kurimo, Matt Shannon, Sayaka Shiota, Jilei Tian, Keiichi Tokuda, Junichi Yamagishi
Toward naturally expressive speech synthesis: data–driven emotion detection using latent affective analysis
Jerome R. Bellegarda
KLATTSTAT: knowledge-based parametric speech synthesis
Gopala Krishna Anumanchipalli, Ying-Chang Cheng, oseph Fernandez, Xiaohan Huang, Qi Mao, Alan W. Black
Recent development of the HMM-based singing voice synthesis system — Sinsy
Keiichiro Oura, Ayami Mase, Tomohiko Yamada, Satoru Muto, Yoshihiko Nankaku, Keiichi Tokuda
Photo-real lips synthesis with trajectory-guided sample selection
Lijuan Wang, Xiaojun Qian, Wei Han, Frank K. Soong
Implementation of VTLN for statistical speech synthesis
Lakshmi Saheer, John Dines, Philip N. Garner, Hui Liang
Do prosodic cues influence uncertainty perception in articulatory speech synthesis?
Eva Lasarcyk, Charlotte Wollermann
An unified and automatic approach of Mandarin HTS system
Yong Guan, Jilei Tian, Yi-Jian Wu, Junichi Yamagishi, Jani Nurminen
Synthesis of listener vocalisations with imposed intonation contours
Sathish Pammi, Marc Schröder, Marcela Charfuelan, Oytun Türk, Ingmar Steiner
An investigation of the impact of speech transcript errors on HMM voices
Jinfu Ni, Hisashi Kawai
An HMM-based singing style modeling system for singing voice synthesizers
Keijiro Saino, Makoto Tachibana, Hideki Kenmochi
Lombard effect mimicking
Dong-Yan Huang, Susanto Rahardja, Ee Ping Ong
Unsupervised prosody labeling for constructing Mandarin TTS
Chen Yu Chiang, Sin-Horng Chen, Yih-Ru Wang
Analysis and synthesis of hypo- and hyperarticulated speech
Benjamin Picart, Thomas Drugman, Thierry Dutoit
Evaluating prosody in synthetic speech with online (eye-tracking) and offline (rating) methods
Rajakrishnan Rajkumar, Michael White, Shari R. Speer, Kiwako Ito
Refined statistical model tuning for speech synthesis
Xu Shao, Vincent Pollet, Andrew Breen
High quality TTS voices within one day
Didier Cadic, Christophe d'Alessandro
Nativization of English words in Spanish using analogy
Tatyana Polyákova, Antonio Bonafonte
Automatic prosodic labeling of accent information for Japanese spoken sentences
Asami Yamamoto, Kazuhiro Suzuki, Kook Cho, Yoichi Yamashita
An automatic pitch model with distance function
Mohamed Abou-Zleikha, Peter Cahill, Julie Carson-Berndsen
Considering readability in text-to-speech recording script design
Minghui Dong, Ling Cen, Paul Chan, Haizhou Li
Letter-based speech synthesis
Oliver Watts, Junichi Yamagishi, Simon King
Joint prosodic and segmental unit selection for expressive speech synthesis
Christophe Veaux, Pierre Lanchantin, Xavier Rodet
Speech synthesis in the mobile user interface
Pieter E. Scholtz, Justus C. Roux, Jacques P. du Toit
Comparison of formant enhancement methods for HMM-based speech synthesis
Tuomo Raitio, Antti Suni, Hannu Pulakka, Martti Vainio, Paavo Alku
EM-HTS: real-time HMM-based Malay emotional speech synthesis
Mumtaz B. Mustafa, Raja N. Ainon, Roziati Zainuddin
High level emotional speech morphing using STRAIGHT
Dong-Yan Huang, Susanto Rahardja, Ee Ping Ong
Adding speaking style to a TTS system
Jean-Philippe Goldman, Sophie Roekhaut, Anne Catherine Simon
Synthesizing fast speech by implementing multi-phone units in unit selection speech synthesis
Donata Moers, Igor Jauk, Bernd Möbius, Petra Wagner
Improved generation of prosodic features in HMM-based Mandarin speech synthesis
Miaomiao Wang, Miaomiao Wen, Daisuke Saito, Keikichi Hirose, Nobuaki Minematsu
An HMM-based speech synthesiser using glottal post-filtering
João P. Cabral, Steve Renals, Korin Richmond, Junichi Yamagishi
A study of lexical stress patterns in unit selection synthesis
Yeon-Jun Kim, Mark C. Beutnagel
Automatic prominence annotation of a German speech synthesis corpus: towards prominence-based prosody generation for unit selection synthesis
Andreas Windmann, Petra Wagner, Fabio Tamburini, Denis Arnold, Catharine Oertel
| Article |
|---|