ISCA Archive SSW 2019 Sessions Search Website Booklet
  ISCA Archive Sessions Search Website Booklet
×

Click on column names to sort.

Searching uses the 'and' of terms e.g. Smith Interspeech matches all papers by Smith in any Interspeech. The order of terms is not significant.

Use double quotes for exact phrasal matches e.g. "acoustic features".

Case is ignored.

Diacritics are optional e.g. lefevre also matches lefèvre (but not vice versa).

It can be useful to turn off spell-checking for the search box in your browser preferences.

If you prefer to scroll rather than page, increase the number in the show entries dropdown.

top

10th ISCA Workshop on Speech Synthesis

Vienna, Austria
20-22 September 2019

Chair: Michael Pucher
doi: 10.21437/SSW.2019

keynote 1: Deep learning for speech synthesis - Aäron van den Oord


Deep learning for speech synthesis
Aäron van den Oord




poster 1: Voice conversion and multi-speaker TTS


Multi-Speaker Modeling for DNN-based Speech Synthesis Incorporating Generative Adversarial Networks
Hiroki Kanagawa, Yusuke Ijima

Speaker Adaptation of Acoustic Model using a Few Utterances in DNN-based Speech Synthesis Systems
Ivan Himawan, Sandesh Aryal, Iris Ouyang, Shukhan Ng, Pierre Lanchantin

DNN-based Speaker Embedding Using Subjective Inter-speaker Similarity for Multi-speaker Modeling in Speech Synthesis
Yuki Saito, Shinnosuke Takamichi, Hiroshi Saruwatari

Generalization of Spectrum Differential based Direct Waveform Modification for Voice Conversion
Wen-Chin Huang, Yi-Chiao Wu, Kazuhiro Kobayashi, Yu-Huai Peng, Hsin-Te Hwang, Patrick Lumban Tobing, Yu Tsao, Hsin-Min Wang, Tomoki Toda

Statistical Voice Conversion with Quasi-periodic WaveNet Vocoder
Yi-Chiao Wu, Patrick Lumban Tobing, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda

Voice Conversion without Explicit Separation of Source and Filter Components Based on Non-negative Matrix Factorization
Hitoshi Suda, Daisuke Saito, Nobuaki Minematsu

Voice conversion based on full-covariance mixture density networks for time-variant linear transformations
Gaku Kotani, Daisuke Saito

Unsupervised Learning of a Disentangled Speech Representation for Voice Conversion
Tobias Gburrek, Thomas Glarner, Janek Ebbers, Reinhold Haeb-Umbach, Petra Wagner

Novel Inception-GAN for Whispered-to-Normal Speech Conversion
Maitreya Patel, Mihir Parmar, Savan Doshi, Nirmesh Shah, Hemant Patil

Implementation of DNN-based real-time voice conversion and its improvements by audio data augmentation and mask-shaped device
Riku Arakawa, Shinnosuke Takamichi, Hiroshi Saruwatari


keynote 2: Synthesizing animal vocalizations and modelling animal speech - Tecumseh Fitch and Bart de Boer


Synthesizing animal vocalizations and modelling animal speech
Tecumseh Fitch, Bart de Boer





keynote 3: Natural Language Generation: Creating Text - Claire Gardent


Natural Language Generation: Creating Text
Claire Gardent





Search papers
Article
×

keynote 1: Deep learning for speech synthesis - Aäron van den Oord

oral 1: Neural vocoder

oral 2: Adaptation

poster 1: Voice conversion and multi-speaker TTS

keynote 2: Synthesizing animal vocalizations and modelling animal speech - Tecumseh Fitch and Bart de Boer

oral 3: Evaluation and performance

oral 4: Speech science

poster 2: Applications and practical issues

keynote 3: Natural Language Generation: Creating Text - Claire Gardent

oral 5: Language and dialect varieties

oral 6: Sequence to sequence model

poster 3: Prosody