ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.02784
  4. Cited By
A Methodology for Controlling the Emotional Expressiveness in Synthetic
  Speech -- a Deep Learning approach

A Methodology for Controlling the Emotional Expressiveness in Synthetic Speech -- a Deep Learning approach

5 July 2019
Noé Tits
ArXiv (abs)PDFHTML

Papers citing "A Methodology for Controlling the Emotional Expressiveness in Synthetic Speech -- a Deep Learning approach"

15 / 15 papers shown
Title
LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech
LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech
Heiga Zen
Viet Dang
R. Clark
Yu Zhang
Ron J. Weiss
Ye Jia
Zhiwen Chen
Yonghui Wu
104
959
0
05 Apr 2019
Visualization and Interpretation of Latent Spaces for Controlling
  Expressive Speech Synthesis through Audio Analysis
Visualization and Interpretation of Latent Spaces for Controlling Expressive Speech Synthesis through Audio Analysis
Noé Tits
Fengna Wang
Kevin El Haddad
Vincent Pagel
Thierry Dutoit
DiffM
67
39
0
27 Mar 2019
Exploring Transfer Learning for Low Resource Emotional TTS
Exploring Transfer Learning for Low Resource Emotional TTS
Noé Tits
Kevin El Haddad
Thierry Dutoit
61
61
0
14 Jan 2019
Hierarchical Generative Modeling for Controllable Speech Synthesis
Hierarchical Generative Modeling for Controllable Speech Synthesis
Wei-Ning Hsu
Yu Zhang
Ron J. Weiss
Heiga Zen
Yonghui Wu
...
Ye Jia
Zhiwen Chen
Jonathan Shen
Patrick Nguyen
Ruoming Pang
BDL
75
276
0
16 Oct 2018
Deep Encoder-Decoder Models for Unsupervised Learning of Controllable
  Speech Synthesis
Deep Encoder-Decoder Models for Unsupervised Learning of Controllable Speech Synthesis
G. Henter
Jaime Lorenzo-Trueba
Xin Wang
Junichi Yamagishi
DRLSSL
65
61
0
30 Jul 2018
The Emotional Voices Database: Towards Controlling the Emotion Dimension
  in Voice Generation Systems
The Emotional Voices Database: Towards Controlling the Emotion Dimension in Voice Generation Systems
Adaeze Adigwe
Noé Tits
Kevin El Haddad
Sarah Ostadabbas
Thierry Dutoit
58
80
0
25 Jun 2018
ASR-based Features for Emotion Recognition: A Transfer Learning Approach
ASR-based Features for Emotion Recognition: A Transfer Learning Approach
Noé Tits
Kevin El Haddad
Thierry Dutoit
43
28
0
23 May 2018
Expressive Speech Synthesis via Modeling Expressions with Variational
  Autoencoder
Expressive Speech Synthesis via Modeling Expressions with Variational Autoencoder
K. Akuzawa
Yusuke Iwasawa
Y. Matsuo
60
139
0
06 Apr 2018
Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with
  Tacotron
Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron
RJ Skerry-Ryan
Eric Battenberg
Y. Xiao
Yuxuan Wang
Daisy Stanton
Joel Shor
Ron J. Weiss
R. Clark
Rif A. Saurous
56
555
0
24 Mar 2018
Style Tokens: Unsupervised Style Modeling, Control and Transfer in
  End-to-End Speech Synthesis
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
Yuxuan Wang
Daisy Stanton
Yu Zhang
RJ Skerry-Ryan
Eric Battenberg
Joel Shor
Y. Xiao
Fei Ren
Ye Jia
Rif A. Saurous
66
827
0
23 Mar 2018
Efficient Neural Audio Synthesis
Efficient Neural Audio Synthesis
Nal Kalchbrenner
Erich Elsen
Karen Simonyan
Seb Noury
Norman Casagrande
Edward Lockhart
Florian Stimberg
Aaron van den Oord
Sander Dieleman
Koray Kavukcuoglu
94
870
0
23 Feb 2018
Efficiently Trainable Text-to-Speech System Based on Deep Convolutional
  Networks with Guided Attention
Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention
Hideyuki Tachibana
Katsuya Uenoyama
Shunsuke Aihara
58
266
0
24 Oct 2017
Tacotron: Towards End-to-End Speech Synthesis
Tacotron: Towards End-to-End Speech Synthesis
Yuxuan Wang
RJ Skerry-Ryan
Daisy Stanton
Yonghui Wu
Ron J. Weiss
...
Samy Bengio
Quoc V. Le
Yannis Agiomyrgiannakis
R. Clark
Rif A. Saurous
166
1,831
0
29 Mar 2017
Deep Voice: Real-time Neural Text-to-Speech
Deep Voice: Real-time Neural Text-to-Speech
Sercan O. Arik
Mike Chrzanowski
Adam Coates
G. Diamos
Andrew Gibiansky
...
John Miller
Andrew Ng
Jonathan Raiman
Shubho Sengupta
Mohammad Shoeybi
97
617
0
25 Feb 2017
WaveNet: A Generative Model for Raw Audio
WaveNet: A Generative Model for Raw Audio
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
406
7,421
0
12 Sep 2016
1