A Methodology for Controlling the Emotional Expressiveness in Synthetic Speech -- a Deep Learning approach

5 July 2019

Papers citing "A Methodology for Controlling the Emotional Expressiveness in Synthetic Speech -- a Deep Learning approach"

15 / 15 papers shown

Title
LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech Heiga Zen Viet Dang R. Clark Yu Zhang Ron J. Weiss Ye Jia Zhiwen Chen Yonghui Wu 104 959 0 05 Apr 2019
Visualization and Interpretation of Latent Spaces for Controlling Expressive Speech Synthesis through Audio Analysis Noé Tits Fengna Wang Kevin El Haddad Vincent Pagel Thierry Dutoit DiffM 67 39 0 27 Mar 2019
Exploring Transfer Learning for Low Resource Emotional TTS Noé Tits Kevin El Haddad Thierry Dutoit 61 61 0 14 Jan 2019
Hierarchical Generative Modeling for Controllable Speech Synthesis Wei-Ning Hsu Yu Zhang Ron J. Weiss Heiga Zen Yonghui Wu ... Ye Jia Zhiwen Chen Jonathan Shen Patrick Nguyen Ruoming Pang BDL 75 276 0 16 Oct 2018
Deep Encoder-Decoder Models for Unsupervised Learning of Controllable Speech Synthesis G. Henter Jaime Lorenzo-Trueba Xin Wang Junichi Yamagishi DRL SSL 65 61 0 30 Jul 2018
The Emotional Voices Database: Towards Controlling the Emotion Dimension in Voice Generation Systems Adaeze Adigwe Noé Tits Kevin El Haddad Sarah Ostadabbas Thierry Dutoit 58 80 0 25 Jun 2018
ASR-based Features for Emotion Recognition: A Transfer Learning Approach Noé Tits Kevin El Haddad Thierry Dutoit 43 28 0 23 May 2018
Expressive Speech Synthesis via Modeling Expressions with Variational Autoencoder K. Akuzawa Yusuke Iwasawa Y. Matsuo 60 139 0 06 Apr 2018
Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron RJ Skerry-Ryan Eric Battenberg Y. Xiao Yuxuan Wang Daisy Stanton Joel Shor Ron J. Weiss R. Clark Rif A. Saurous 56 555 0 24 Mar 2018
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis Yuxuan Wang Daisy Stanton Yu Zhang RJ Skerry-Ryan Eric Battenberg Joel Shor Y. Xiao Fei Ren Ye Jia Rif A. Saurous 66 827 0 23 Mar 2018
Efficient Neural Audio Synthesis Nal Kalchbrenner Erich Elsen Karen Simonyan Seb Noury Norman Casagrande Edward Lockhart Florian Stimberg Aaron van den Oord Sander Dieleman Koray Kavukcuoglu 94 870 0 23 Feb 2018
Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention Hideyuki Tachibana Katsuya Uenoyama Shunsuke Aihara 58 266 0 24 Oct 2017
Tacotron: Towards End-to-End Speech Synthesis Yuxuan Wang RJ Skerry-Ryan Daisy Stanton Yonghui Wu Ron J. Weiss ... Samy Bengio Quoc V. Le Yannis Agiomyrgiannakis R. Clark Rif A. Saurous 166 1,831 0 29 Mar 2017
Deep Voice: Real-time Neural Text-to-Speech Sercan O. Arik Mike Chrzanowski Adam Coates G. Diamos Andrew Gibiansky ... John Miller Andrew Ng Jonathan Raiman Shubho Sengupta Mohammad Shoeybi 97 617 0 25 Feb 2017
WaveNet: A Generative Model for Raw Audio Aaron van den Oord Sander Dieleman Heiga Zen Karen Simonyan Oriol Vinyals Alex Graves Nal Kalchbrenner A. Senior Koray Kavukcuoglu DiffM 406 7,421 0 12 Sep 2016