Learning latent representations for style control and transfer in
end-to-end speech synthesis

Learning latent representations for style control and transfer in end-to-end speech synthesis

11 December 2018

Shifeng Pan

Papers citing "Learning latent representations for style control and transfer in end-to-end speech synthesis"

14 / 14 papers shown

Title
Decision-Making with Auto-Encoding Variational Bayes Romain Lopez Pierre Boyeau Nir Yosef Michael I. Jordan Jeffrey Regier BDL 177 10,591 0 17 Feb 2020
Neural Speech Synthesis with Transformer Network Naihan Li Shujie Liu Yanqing Liu Sheng Zhao Ming-Yuan Liu M. Zhou 31 102 0 19 Sep 2018
Predicting Expressive Speaking Style From Text In End-To-End Speech Synthesis Daisy Stanton Yuxuan Wang RJ Skerry-Ryan 36 122 0 04 Aug 2018
Understanding disentangling in $β$ -VAE Christopher P. Burgess I. Higgins Arka Pal Loic Matthey Nicholas Watters Guillaume Desjardins Alexander Lerchner CoGe DRL 42 828 0 10 Apr 2018
Expressive Speech Synthesis via Modeling Expressions with Variational Autoencoder K. Akuzawa Yusuke Iwasawa Y. Matsuo 25 139 0 06 Apr 2018
Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron RJ Skerry-Ryan Eric Battenberg Y. Xiao Yuxuan Wang Daisy Stanton Joel Shor Ron J. Weiss R. Clark Rif A. Saurous 40 550 0 24 Mar 2018
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis Yuxuan Wang Daisy Stanton Yu Zhang RJ Skerry-Ryan Eric Battenberg Joel Shor Y. Xiao Fei Ren Ye Jia Rif A. Saurous 57 822 0 23 Mar 2018
Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions Jonathan Shen Ruoming Pang Ron J. Weiss M. Schuster Navdeep Jaitly ... Yuxuan Wang RJ Skerry-Ryan Rif A. Saurous Yannis Agiomyrgiannakis Yonghui Wu 61 2,684 0 16 Dec 2017
Learning Latent Representations for Speech Generation and Transformation Wei-Ning Hsu Yu Zhang James R. Glass DRL BDL SSL 40 145 0 13 Apr 2017
WaveNet: A Generative Model for Raw Audio Aaron van den Oord Sander Dieleman Heiga Zen Karen Simonyan Oriol Vinyals Alex Graves Nal Kalchbrenner A. Senior Koray Kavukcuoglu DiffM 270 7,361 0 12 Sep 2016
Tutorial on Variational Autoencoders Carl Doersch BDL DRL 74 1,736 0 19 Jun 2016
Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations David M. Krueger Tegan Maharaj János Kramár Mohammad Pezeshki Nicolas Ballas Nan Rosemary Ke Anirudh Goyal Yoshua Bengio Aaron Courville C. Pal 51 317 0 03 Jun 2016
Generating Sentences from a Continuous Space Samuel R. Bowman Luke Vilnis Oriol Vinyals Andrew M. Dai Rafal Jozefowicz Samy Bengio DRL 74 2,352 0 19 Nov 2015
Attention-Based Models for Speech Recognition J. Chorowski Dzmitry Bahdanau Dmitriy Serdyuk Kyunghyun Cho Yoshua Bengio 95 2,602 0 24 Jun 2015