Scaling and bias codes for modeling speaker-adaptive DNN-based speech synthesis systems

31 July 2018

Papers citing "Scaling and bias codes for modeling speaker-adaptive DNN-based speech synthesis systems"

12 / 12 papers shown

Title
Multimodal speech synthesis architecture for unsupervised speaker adaptation Hieu-Thi Luong Junichi Yamagishi 45 10 0 20 Aug 2018
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis Ye Jia Yu Zhang Ron J. Weiss Quan Wang Jonathan Shen ... Zhiwen Chen Patrick Nguyen Ruoming Pang Ignacio López Moreno Yonghui Wu 251 830 0 12 Jun 2018
Collapsed speech segment detection and suppression for WaveNet vocoder Yi-Chiao Wu Kazuhiro Kobayashi Tomoki Hayashi Patrick Lumban Tobing Tomoki Toda 35 25 0 30 Apr 2018
Expressive Speech Synthesis via Modeling Expressions with Variational Autoencoder K. Akuzawa Yusuke Iwasawa Y. Matsuo 35 139 0 06 Apr 2018
Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron RJ Skerry-Ryan Eric Battenberg Y. Xiao Yuxuan Wang Daisy Stanton Joel Shor Ron J. Weiss R. Clark Rif A. Saurous 54 554 0 24 Mar 2018
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis Yuxuan Wang Daisy Stanton Yu Zhang RJ Skerry-Ryan Eric Battenberg Joel Shor Y. Xiao Fei Ren Ye Jia Rif A. Saurous 64 826 0 23 Mar 2018
Linear networks based speaker adaptation for speech synthesis Zhiying Huang Heng Lu Ming Lei Zhijie Yan 33 14 0 05 Mar 2018
Fitting New Speakers Based on a Short Untranscribed Sample Eliya Nachmani Adam Polyak Yaniv Taigman Lior Wolf 43 84 0 20 Feb 2018
Neural Voice Cloning with a Few Samples Sercan O. Arik Jitong Chen Kainan Peng Ming-Yu Liu Yanqi Zhou 58 387 0 14 Feb 2018
Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions Jonathan Shen Ruoming Pang Ron J. Weiss M. Schuster Navdeep Jaitly ... Yuxuan Wang RJ Skerry-Ryan Rif A. Saurous Yannis Agiomyrgiannakis Yonghui Wu 77 2,697 0 16 Dec 2017
Embedding-Based Speaker Adaptive Training of Deep Neural Networks Xiaodong Cui Vaibhava Goel G. Saon 33 40 0 17 Oct 2017
WaveNet: A Generative Model for Raw Audio Aaron van den Oord Sander Dieleman Heiga Zen Karen Simonyan Oriol Vinyals Alex Graves Nal Kalchbrenner A. Senior Koray Kavukcuoglu DiffM 383 7,389 0 12 Sep 2016