Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1807.11632
Cited By
Scaling and bias codes for modeling speaker-adaptive DNN-based speech synthesis systems
31 July 2018
Hieu-Thi Luong
Junichi Yamagishi
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Scaling and bias codes for modeling speaker-adaptive DNN-based speech synthesis systems"
12 / 12 papers shown
Title
Multimodal speech synthesis architecture for unsupervised speaker adaptation
Hieu-Thi Luong
Junichi Yamagishi
45
10
0
20 Aug 2018
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
Ye Jia
Yu Zhang
Ron J. Weiss
Quan Wang
Jonathan Shen
...
Zhiwen Chen
Patrick Nguyen
Ruoming Pang
Ignacio López Moreno
Yonghui Wu
251
830
0
12 Jun 2018
Collapsed speech segment detection and suppression for WaveNet vocoder
Yi-Chiao Wu
Kazuhiro Kobayashi
Tomoki Hayashi
Patrick Lumban Tobing
Tomoki Toda
35
25
0
30 Apr 2018
Expressive Speech Synthesis via Modeling Expressions with Variational Autoencoder
K. Akuzawa
Yusuke Iwasawa
Y. Matsuo
35
139
0
06 Apr 2018
Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron
RJ Skerry-Ryan
Eric Battenberg
Y. Xiao
Yuxuan Wang
Daisy Stanton
Joel Shor
Ron J. Weiss
R. Clark
Rif A. Saurous
54
554
0
24 Mar 2018
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
Yuxuan Wang
Daisy Stanton
Yu Zhang
RJ Skerry-Ryan
Eric Battenberg
Joel Shor
Y. Xiao
Fei Ren
Ye Jia
Rif A. Saurous
64
826
0
23 Mar 2018
Linear networks based speaker adaptation for speech synthesis
Zhiying Huang
Heng Lu
Ming Lei
Zhijie Yan
33
14
0
05 Mar 2018
Fitting New Speakers Based on a Short Untranscribed Sample
Eliya Nachmani
Adam Polyak
Yaniv Taigman
Lior Wolf
43
84
0
20 Feb 2018
Neural Voice Cloning with a Few Samples
Sercan O. Arik
Jitong Chen
Kainan Peng
Ming-Yu Liu
Yanqi Zhou
58
387
0
14 Feb 2018
Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions
Jonathan Shen
Ruoming Pang
Ron J. Weiss
M. Schuster
Navdeep Jaitly
...
Yuxuan Wang
RJ Skerry-Ryan
Rif A. Saurous
Yannis Agiomyrgiannakis
Yonghui Wu
77
2,697
0
16 Dec 2017
Embedding-Based Speaker Adaptive Training of Deep Neural Networks
Xiaodong Cui
Vaibhava Goel
G. Saon
33
40
0
17 Oct 2017
WaveNet: A Generative Model for Raw Audio
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
383
7,389
0
12 Sep 2016
1