ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.03499
  4. Cited By
WaveNet: A Generative Model for Raw Audio
v1v2 (latest)

WaveNet: A Generative Model for Raw Audio

12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
    DiffM
ArXiv (abs)PDFHTML

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,082 papers shown
Title
A Real-Time Wideband Neural Vocoder at 1.6 kb/s Using LPCNet
A Real-Time Wideband Neural Vocoder at 1.6 kb/s Using LPCNet
J. Valin
Jan Skoglund
62
79
0
28 Mar 2019
CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
Kyubyong Park
Thomas Mulc
83
101
0
27 Mar 2019
WGANSing: A Multi-Voice Singing Voice Synthesizer Based on the
  Wasserstein-GAN
WGANSing: A Multi-Voice Singing Voice Synthesizer Based on the Wasserstein-GAN
Pritish Chandna
Merlijn Blaauw
J. Bonada
E. Gómez
114
62
0
26 Mar 2019
General Probabilistic Surface Optimization and Log Density Estimation
General Probabilistic Surface Optimization and Log Density Estimation
Dmitry Kopitkov
Vadim Indelman
87
1
0
25 Mar 2019
Bandwidth Extension on Raw Audio via Generative Adversarial Networks
Bandwidth Extension on Raw Audio via Generative Adversarial Networks
S. Kim
V. Sathe
GAN
58
26
0
21 Mar 2019
Smart Edition of MIDI Files
Smart Edition of MIDI Files
Pierre Roy
F. Pachet
28
1
0
20 Mar 2019
Neural Networks for Lorenz Map Prediction: A Trip Through Time
Neural Networks for Lorenz Map Prediction: A Trip Through Time
Denisa Roberts
AI4TSAI4CE
21
1
0
18 Mar 2019
A Vocoder Based Method For Singing Voice Extraction
A Vocoder Based Method For Singing Voice Extraction
Pritish Chandna
Merlijn Blaauw
J. Bonada
E. Gómez
43
9
0
18 Mar 2019
Bilinear Representation for Language-based Image Editing Using
  Conditional Generative Adversarial Networks
Bilinear Representation for Language-based Image Editing Using Conditional Generative Adversarial Networks
Xiaofeng Mao
YueFeng Chen
Yuhong Li
T. Xiong
Yuan He
Hui Xue
GAN
81
21
0
18 Mar 2019
Counterpoint by Convolution
Counterpoint by Convolution
Cheng-Zhi Anna Huang
Tim Cooijmans
Adam Roberts
Aaron Courville
Douglas Eck
BDL
81
151
0
18 Mar 2019
Generative adversarial network-based glottal waveform model for
  statistical parametric speech synthesis
Generative adversarial network-based glottal waveform model for statistical parametric speech synthesis
Bajibabu Bollepalli
Lauri Juvela
P. Alku
63
46
0
14 Mar 2019
Voice command generation using Progressive Wavegans
Voice command generation using Progressive Wavegans
Thomas Wiest
N. Cummins
Alice Baird
Simone Hantke
J. Dineley
Björn Schuller
GAN
35
1
0
13 Mar 2019
Deep Text-to-Speech System with Seq2Seq Model
Deep Text-to-Speech System with Seq2Seq Model
Gary Wang
AI4TS
41
9
0
11 Mar 2019
Scaling up deep neural networks: a capacity allocation perspective
Scaling up deep neural networks: a capacity allocation perspective
Jonathan Donier
46
0
0
11 Mar 2019
Accelerating Minibatch Stochastic Gradient Descent using Typicality
  Sampling
Accelerating Minibatch Stochastic Gradient Descent using Typicality Sampling
Xinyu Peng
Li Li
Feiyue Wang
BDL
140
59
0
11 Mar 2019
Singing voice conversion with non-parallel data
Singing voice conversion with non-parallel data
Xin Chen
Wei Chu
Jinxi Guo
N. Xu
39
28
0
11 Mar 2019
A Deep Generative Model of Speech Complex Spectrograms
A Deep Generative Model of Speech Complex Spectrograms
Aditya Arie Nugraha
Kouhei Sekiguchi
Kazuyoshi Yoshii
47
19
0
08 Mar 2019
A Character-Level Approach to the Text Normalization Problem Based on a
  New Causal Encoder
A Character-Level Approach to the Text Normalization Problem Based on a New Causal Encoder
Adrián Javaloy Bornás
G. García-Mateos
CML
11
3
0
06 Mar 2019
Autoregressive Convolutional Recurrent Neural Network for Univariate and
  Multivariate Time Series Prediction
Autoregressive Convolutional Recurrent Neural Network for Univariate and Multivariate Time Series Prediction
Matteo Maggiolo
Gerasimos Spanakis
AI4TSBDL
62
9
0
06 Mar 2019
High-Fidelity Image Generation With Fewer Labels
High-Fidelity Image Generation With Fewer Labels
Mario Lucic
Michael Tschannen
Marvin Ritter
Xiaohua Zhai
Olivier Bachem
Sylvain Gelly
GANOOD
130
159
0
06 Mar 2019
MS-TCN: Multi-Stage Temporal Convolutional Network for Action
  Segmentation
MS-TCN: Multi-Stage Temporal Convolutional Network for Action Segmentation
Yazan Abu Farha
Juergen Gall
90
671
0
05 Mar 2019
VideoFlow: A Conditional Flow-Based Model for Stochastic Video
  Generation
VideoFlow: A Conditional Flow-Based Model for Stochastic Video Generation
Manoj Kumar
Mohammad Babaeizadeh
D. Erhan
Chelsea Finn
Sergey Levine
Laurent Dinh
Durk Kingma
VGen
98
132
0
04 Mar 2019
Analysing Deep Learning-Spectral Envelope Prediction Methods for Singing
  Synthesis
Analysing Deep Learning-Spectral Envelope Prediction Methods for Singing Synthesis
F. Bous
A. Röbel
34
3
0
04 Mar 2019
Equilibrated Recurrent Neural Network: Neuronal Time-Delayed
  Self-Feedback Improves Accuracy and Stability
Equilibrated Recurrent Neural Network: Neuronal Time-Delayed Self-Feedback Improves Accuracy and Stability
Ziming Zhang
Anil Kag
Alan Sullivan
Venkatesh Saligrama
47
5
0
02 Mar 2019
Fine-Grained Semantic Segmentation of Motion Capture Data using Dilated
  Temporal Fully-Convolutional Networks
Fine-Grained Semantic Segmentation of Motion Capture Data using Dilated Temporal Fully-Convolutional Networks
N. Cheema
S. Hosseini
J. Sprenger
E. Herrmann
H. Du
K. Fischer
P. Slusallek
18
3
0
02 Mar 2019
1D Convolutional Neural Network Models for Sleep Arousal Detection
1D Convolutional Neural Network Models for Sleep Arousal Detection
M. Zabihi
Ali Bahrami Rad
S. Kiranyaz
Simo Särkkä
Moncef Gabbouj
47
14
0
01 Mar 2019
A Unified Neural Architecture for Instrumental Audio Tasks
A Unified Neural Architecture for Instrumental Audio Tasks
Steven Spratley
Daniel Beck
Trevor Cohn
56
5
0
01 Mar 2019
Assume, Augment and Learn: Unsupervised Few-Shot Meta-Learning via
  Random Labels and Data Augmentation
Assume, Augment and Learn: Unsupervised Few-Shot Meta-Learning via Random Labels and Data Augmentation
Antreas Antoniou
Amos Storkey
SSL
113
75
0
26 Feb 2019
The State of Sparsity in Deep Neural Networks
The State of Sparsity in Deep Neural Networks
Trevor Gale
Erich Elsen
Sara Hooker
193
765
0
25 Feb 2019
Wasserstein-Wasserstein Auto-Encoders
Wasserstein-Wasserstein Auto-Encoders
Shunkang Zhang
Yuan Gao
Yuling Jiao
Jin Liu
Yang Wang
Can Yang
DRLDiffM
29
13
0
25 Feb 2019
Attentional Encoder Network for Targeted Sentiment Classification
Attentional Encoder Network for Targeted Sentiment Classification
Youwei Song
Jiahai Wang
Tao Jiang
Zhiyue Liu
Yanghui Rao
72
278
0
25 Feb 2019
GANSynth: Adversarial Neural Audio Synthesis
GANSynth: Adversarial Neural Audio Synthesis
Jesse Engel
Kumar Krishna Agrawal
Shuo Chen
Ishaan Gulrajani
Chris Donahue
Adam Roberts
111
393
0
23 Feb 2019
Towards Neural Mixture Recommender for Long Range Dependent User
  Sequences
Towards Neural Mixture Recommender for Long Range Dependent User Sequences
Jiaxi Tang
Francois Belletti
Sagar Jain
Minmin Chen
Alex Beutel
Can Xu
Ed H. Chi
64
92
0
22 Feb 2019
Capacity allocation through neural network layers
Capacity allocation through neural network layers
Jonathan Donier
48
3
0
22 Feb 2019
Probabilistic Neural-symbolic Models for Interpretable Visual Question
  Answering
Probabilistic Neural-symbolic Models for Interpretable Visual Question Answering
Ramakrishna Vedantam
Karan Desai
Stefan Lee
Marcus Rohrbach
Dhruv Batra
Devi Parikh
NAIBDL
97
87
0
21 Feb 2019
Audio-Linguistic Embeddings for Spoken Sentences
Audio-Linguistic Embeddings for Spoken Sentences
Albert Haque
Michelle Guo
Prateek Verma
Li Fei-Fei
80
51
0
20 Feb 2019
Data Efficient Voice Cloning for Neural Singing Synthesis
Data Efficient Voice Cloning for Neural Singing Synthesis
Merlijn Blaauw
J. Bonada
R. Daido
137
33
0
19 Feb 2019
Securing Voice-driven Interfaces against Fake (Cloned) Audio Attacks
Securing Voice-driven Interfaces against Fake (Cloned) Audio Attacks
Hafiz Malik
53
26
0
18 Feb 2019
STCN: Stochastic Temporal Convolutional Networks
STCN: Stochastic Temporal Convolutional Networks
Emre Aksan
Otmar Hilliges
BDL
59
62
0
18 Feb 2019
Learning to Adaptively Scale Recurrent Neural Networks
Learning to Adaptively Scale Recurrent Neural Networks
Hao Hu
Liqiang Wang
Guo-Jun Qi
AI4CE
42
10
0
15 Feb 2019
Fully Convolutional Networks for Text Classification
Fully Convolutional Networks for Text Classification
Jacob Anderson
16
4
0
14 Feb 2019
Toward Ergonomic Risk Prediction via Segmentation of Indoor Object
  Manipulation Actions Using Spatiotemporal Convolutional Networks
Toward Ergonomic Risk Prediction via Segmentation of Indoor Object Manipulation Actions Using Spatiotemporal Convolutional Networks
Behnoosh Parsa
Ekta U. Samani
Rose Hendrix
Cameron Devine
Shashi M. Singh
Santosh Devasia
A. Banerjee
51
24
0
14 Feb 2019
Recurrent Neural Networks with Stochastic Layers for Acoustic Novelty
  Detection
Recurrent Neural Networks with Stochastic Layers for Acoustic Novelty Detection
Duong Nguyen
O. Kirsebom
F. Frazão
Ronan Fablet
Stan Matwin
45
5
0
13 Feb 2019
Capacity allocation analysis of neural networks: A tool for principled
  architecture design
Capacity allocation analysis of neural networks: A tool for principled architecture design
Jonathan Donier
49
4
0
12 Feb 2019
Unpriortized Autoencoder For Image Generation
Unpriortized Autoencoder For Image Generation
Jaeyoung Yoo
Hojun Lee
Nojun Kwak
SyDaSSLGANDRL
34
2
0
12 Feb 2019
Towards a Robust Deep Neural Network in Texts: A Survey
Towards a Robust Deep Neural Network in Texts: A Survey
Wenqi Wang
Benxiao Tang
Run Wang
Lina Wang
Aoshuang Ye
AAML
99
39
0
12 Feb 2019
MaCow: Masked Convolutional Generative Flow
MaCow: Masked Convolutional Generative Flow
Xuezhe Ma
Xiang Kong
Shanghang Zhang
Eduard H. Hovy
DRL
74
66
0
12 Feb 2019
Adversarial Generation of Time-Frequency Features with application in
  audio synthesis
Adversarial Generation of Time-Frequency Features with application in audio synthesis
Andrés Marafioti
Nicki Holighaus
Nathanael Perraudin
P. Majdak
74
68
0
11 Feb 2019
A Vocoder-free WaveNet Voice Conversion with Non-Parallel Data
A Vocoder-free WaveNet Voice Conversion with Non-Parallel Data
Xiaohai Tian
Chng Eng Siong
Haizhou Li
36
7
0
11 Feb 2019
Data-Driven Vehicle Trajectory Forecasting
Data-Driven Vehicle Trajectory Forecasting
Shayan Jawed
Eya Boumaiza
Josif Grabocka
Lars Schmidt-Thieme
55
5
0
09 Feb 2019
Previous
123...505152...606162
Next