Learning optimal audiovisual phasing for an HMM-based control model for facial animation
Oxana Govokhina, Gérard Bailly, Gaspard Breton
Control concepts for articulatory speech synthesis
Peter Birkholz, Ingmar Steiner, Stefan Breuer
Spectral control in concatenative speech synthesis
Alexander B. Kain, Qi Miao, Jan P. H. van Santen
Feature transformation applied to the detection of discontinuities in concatenated speech
Barry Kirkpatrick, Darragh O'Brien, Ronán Scaife
Towards conversational speech synthesis; lessons learned from the expressive speech processing project
Nick Campbell
Communicative speech synthesis with XIMERA: a first step
Shinsuke Sakai, Jinfu Ni, Ranniery Maia, Keiichi Tokuda, Minoru Tsuzaki, Tomoki Toda, Hisashi Kawai, Satoshi Nakamura
Automatic exploration of corpus-specific properties for expressive text-to-speech: a case study in emphasis
Raul Fernandez, Bhuvana Ramabhadran
Modeling and perceiving of (un-)certainty in articulatory speech synthesis
Charlotte Wollermann, Eva Lasarcyk
Perceptual annotation of expressive speech
Lijuan Wang, Min Chu, Yaya Peng, Yong Zhao, Frank K. Soong
Joint analysis of speech frames for synthesis based on lossy tube models
Karl Schnell, Arild Lacroix
Are rule-based syllabification methods adequate for languages with low syllabic complexity? the case of Italian
Connie R. Adsett, Yannick Marchand
Spoken language conversion with accent morphing
Mark Huckvale, Kayoko Yanagisawa
Comparative investigation of peak alignment in Polish and German unit selection corpora
Grazyna Demenko, Agnieszka Wagner, Matthias Jilka, Bernd Möbius
Optimization of Polish segmental duration prediction with CART
Katarzyna Klessa, Marcin Szymanski, Stefan Breuer, Grazyna Demenko
Utilization of an HMM-based feature generation module in 5 ms segment concatenative speech synthesis
Toshio Hirai, Junichi Yamagishi, Seiichi Tenpaku
Clustering algorithm for F0 curves based on hidden Markov models
Damien Lolive, Nelly Barbot, Olivier Boeffard
Building a better Indian English voice using "more data"
Rohit Kumar, Rashmi Gangadharaiah, Sharath Rao, Kishore Prahallad, Carolyn P. Rosé, Alan W. Black
Creating German unit selection voices for the MARY TTS platform from the BITS corpora
Marc Schröder, Anna Hunecke
Regression approaches to voice quality controll based on one-to-many eigenvoice conversion
Kumi Ohta, Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
An evaluation of many-to-one voice conversion algorithms with pre-stored speaker data sets
Daisuke Tani, Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
Towards an improved modeling of the glottal source in statistical parametric speech synthesis
Joao P. Cabral, Steve Renals, Korin Richmond, Junichi Yamagishi
GMM-based speech transformation systems under data reduction
Larbi Mesbahi, Vincent Barreaud, Olivier Boeffard
Improved average-voice-based speech synthesis using gender-mixed modeling and a parameter generation algorithm considering GV
Junichi Yamagishi, Takao Kobayashi, Steve Renals, Simon King, Heiga Zen, Tomoki Toda, Keiichi Tokuda
An excitation model for HMM-based speech synthesis based on residual modeling
Ranniery Maia, Tomoki Toda, Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda
An HMM-based bilingual (Mandarin-English) TTS
Hui Liang, Yao Qian, Frank K. Soong
Data-driven approach to rapid prototyping Xhosa speech synthesis
Justus C. Roux, Albert S. Visagie
CRF-based statistical learning of Japanese accent sandhi for developing Japanese text-to-speech synthesis systems
Nobuaki Minematsu, Ryo Kuroiwa, Keikichi Hirose, Michiko Watanabe
Two-step generation of Mandarin F0 contours based on tone nucleus and superpositional models
Qinghua Sun, Keikichi Hirose, Nobuaki Minematsu
Design of tree-based context clustering for an HMM-based Thai speech synthesis system
Suphattharachai Chomphan, Takao Kobayashi
Development of a BOSS unit selection module for tone languages
Arne Bachmann, Stefan Breuer
Unit-selection text-to-speech synthesis using an asynchronous interpolation model
Alexander B. Kain, Jan P. H. van Santen
Modelling voiceless speech segments by means of an additive procedure based on the computation of formant sinusoids
Ingo Hertrich, Hermann Ackermann
Using articulatory position data in voice transformation
Arthur R. Toth, Alan W. Black
Text processing for text-to-speech systems in Indian languages
Anand Arokia Raj, Tanuja Sarkar, Satish Chandra Pammi, Santhosh Yuvaraj, Mohit Bansal, Kishore Prahallad, Alan W. Black
Flexible harmonic/stochastic speech synthesis
Daniel Erro, Asunción Moreno, Antonio Bonafonte
Prosody modelling in Czech text-to-speech synthesis
Jan Romportl, Jirí Kala
Measuring attribute dissimilarity with HMM KL-divergence for speech synthesis
Yong Zhao, Chengsuo Zhang, Frank K. Soong, Min Chu, Xi Xiao
Lagrangian relaxation for optimal corpus design
Jonathan Chevelu, Nelly Barbot, Olivier Boeffard, Arnaud Delhay
Adaptive database reduction for domain specific speech synthesis
Aleksandra Krul, Géraldine Damnati, François Yvon, Cédric Boidin, Thierry Moudenc
Statistical analysis of filled pauses² rhythm for disfluent speech synthesis
Jordi Adell, Antonio Bonafonte, David Escudero
Quantitative analysis of F0 contours of emotional speech of Mandarin
Wentao Gu, Tan Lee
Maximum-likelihood dynamic intonation model for concatenative text-to-speech system
Slava Shechtman
Data-driven extraction of intonation contour classes
Uwe D. Reichel
Word accentuation prediction using a neural net classifier
Taniya Mishra, Emily Tucker Prud'hommeaux, Jan P. H. van Santen
Issues of optionality in pitch accent placement
Leonardo Badino, Robert A. J. Clark
Single speaker segmentation and inventory selection using dynamic time warping self organization and joint multigram mapping
Matthew P. Aylett, Simon King
How (not) to select your voice corpus: random selection vs. phonologically balanced
Tanya Lambert, Norbert Braunschweiler, Sabine Buchholz
Unit selection synthesis using long non-uniform units and phonemic identity matching
Lukas Latacz, Yuk On Kong, Werner Verhelst
Evaluation of various unit types in the unit selection approach for the Czech language using the Festival system
Martin Gruber, Daniel Tihelka, Jindrich Matousek
Assessing the adequate treatment of fast speech in unit selection speech synthesis systems for the visually impaired
Donata Moers, Petra Wagner, Stefan Breuer
Making speech synthesis more accessible to older people
Maria Wolters, Pauline Campbell, Christine DePlacido, Amy Liddell, David Owens
The HMM-based speech synthesis system (HTS) version 2.0
Heiga Zen, Takashi Nose, Junichi Yamagishi, Shinji Sako, Takashi Masuko, Alan W. Black, Keiichi Tokuda
eCIRCUS: building voices for autonomous speaking agents
Christian Weiss, Luis C. Oliveira, Sergio Paulo, Carlos Mendes, Luis Figueira, Marco Vala, Pedro Sequeira, Ana Paiva, Thurid Vogt, Elisabeth Andre
Unit selection synthesis in the Smartweb project
Martin Barbisch, Grzegorz Dogil, Bernd Möbius, Bettina Säuberlich, Antje Schweitzer
Building a Finnish unit selection TTS system
Hanna Silen, Elina Helander, Konsta Koppinen, Moncef Gabbouj
Evaluating automatic syllabification algorithms for English
Yannick Marchand, Connie R. Adsett, Robert I. Damper
Voice building from insufficient data - classroom experiences with web-based language development tools
John Kominek, Tanja Schultz, Alan W. Black
SVM based feature extraction in speech synthesis
Peter Cahill, Jan Macek, Julie Carson-Berndsen
Spectral conversion based on statistical models including time-sequence matching
Yoshihiko Nankaku, Kenichi Nakamura, Tomoki Toda, Keiichi Tokuda
Analysis of affective speech recordings using the superpositional intonation model
Esther Klabbers, Taniya Mishra, Jan P. H. van Santen
Calliphony: a real-time intonation controller for expressive speech synthesis
Sylvain Le Beux, Albert Rilliard, Christophe d'Alessandro
Epoch synchronous non-overlap-add (ESNOLA) method-based concatenative speech synthesis system for Bangla
Shyamal Kumar Das Mandal, Asoke Kumar Datta
Syllable-based Thai duration model using multi-level linear regression and syllable accommodation
Chatchawarn Hansakunbuntheung, Hiroaki Kato, Yoshinori Sagisaka
Linguistic and mixed excitation improvements on a HMM-based speech synthesis for Castilian Spanish
Xavier Gonzalvo, Joan Claudi Socoró, Ignasi Iriondo, Carlos Monzo, Elisa Martínez
Inventory of intonation contours for text-to-speech synthesis
Tetyana Lyudovyk, Valentyna Robeiko
Analysis methods for assessing TTS intelligibility
H. Timothy Bunnell, Jason Lilley
Understandable production of massive synthesis
Brian Langner, Alan W. Black
The online evaluation of speech synthesis using eye movements
Charlotte van Hooijdonk, Edwin Commandeur, Reinier Cozijn, Emiel Krahmer, Erwin Marsi
Article |
---|