Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.03499
Cited By
v1
v2 (latest)
WaveNet: A Generative Model for Raw Audio
12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"WaveNet: A Generative Model for Raw Audio"
50 / 3,082 papers shown
Title
Autoencoding sensory substitution
Viktor Tóth
L. Parkkonen
35
6
0
14 Jul 2019
Generative Modeling by Estimating Gradients of the Data Distribution
Yang Song
Stefano Ermon
SyDa
DiffM
273
3,972
0
12 Jul 2019
R-Transformer: Recurrent Neural Network Enhanced Transformer
Z. Wang
Yao Ma
Zitao Liu
Jiliang Tang
ViT
82
106
0
12 Jul 2019
On the Evaluation of Conditional GANs
Terrance Devries
Adriana Romero
Luis Villaseñor-Pineda
Graham W. Taylor
M. Drozdzal
EGVM
87
43
0
11 Jul 2019
Multi-Speaker End-to-End Speech Synthesis
Jihyun Park
Kexin Zhao
Kainan Peng
Ming-Yu Liu
SyDa
74
19
0
09 Jul 2019
Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning
Yu Zhang
Ron J. Weiss
Heiga Zen
Yonghui Wu
Zhiwen Chen
RJ Skerry-Ryan
Ye Jia
Andrew Rosenberg
Bhuvana Ramabhadran
76
189
0
09 Jul 2019
M3D-GAN: Multi-Modal Multi-Domain Translation with Universal Attention
Shuang Ma
Daniel J. McDuff
Yale Song
44
4
0
09 Jul 2019
Towards Debugging Deep Neural Networks by Generating Speech Utterances
Bilal Soomro
Anssi Kanervisto
Trung Ngo Trong
Ville Hautamaki
21
0
0
06 Jul 2019
Speech bandwidth extension with WaveNet
Archit Gupta
Brendan Shillingford
Yannis Assael
Thomas C. Walters
60
29
0
05 Jul 2019
A Methodology for Controlling the Emotional Expressiveness in Synthetic Speech -- a Deep Learning approach
Noé Tits
40
10
0
05 Jul 2019
Neural Drum Machine : An Interactive System for Real-time Synthesis of Drum Sounds
Cyran Aouameur
P. Esling
Gaëtan Hadjeres
42
22
0
04 Jul 2019
The Indirect Convolution Algorithm
Marat Dukhan
68
42
0
03 Jul 2019
Multitasking with Alexa Multitasking with Alexa: How Using Intelligent Personal Assistants Impacts Language-based Primary Task Performance
Justin Edwards
H. Liu
Tianyu Zhou
Sandy J. J. Gould
L. Clark
Philip R. Doyle
Benjamin R. Cowan
37
23
0
03 Jul 2019
Deep Learning Based Energy Disaggregation and On/Off Detection of Household Appliances
Jie Jiang
Qiuqiang Kong
Mark D. Plumbley
Nigel Gilbert
61
59
0
03 Jul 2019
Generative Models for Automatic Chemical Design
Daniel Schwalbe-Koda
Rafael Gómez-Bombarelli
MedIm
AI4CE
87
81
0
02 Jul 2019
Themis: Fair and Efficient GPU Cluster Scheduling
Kshiteej S. Mahajan
Arjun Balasubramanian
Arjun Singhvi
Shivaram Venkataraman
Aditya Akella
Amar Phanishayee
Shuchi Chawla
80
185
0
02 Jul 2019
Conditioned-U-Net: Introducing a Control Mechanism in the U-Net for Multiple Source Separations
Gabriel Meseguer-Brocal
Geoffroy Peeters
84
61
0
02 Jul 2019
A Tandem Learning Rule for Effective Training and Rapid Inference of Deep Spiking Neural Networks
Jibin Wu
Yansong Chua
Malu Zhang
Guoqi Li
Haizhou Li
Kay Chen Tan
78
14
0
02 Jul 2019
Adaptive Music Composition for Games
P. Hutchings
Jon McCormack
65
29
0
02 Jul 2019
Quasi-Periodic WaveNet Vocoder: A Pitch Dependent Dilated Convolution Model for Parametric Speech Generation
Yi-Chiao Wu
Tomoki Hayashi
Patrick Lumban Tobing
Kazuhiro Kobayashi
Tomoki Toda
63
16
0
01 Jul 2019
Analysis by Adversarial Synthesis -- A Novel Approach for Speech Vocoding
Ahmed Mustafa
A. Biswas
Christian Bergler
Julia Schottenhamml
Andreas Maier
GAN
50
4
0
01 Jul 2019
Deep Residual Neural Networks for Audio Spoofing Detection
M. Alzantot
Ziqi Wang
Mani B. Srivastava
77
169
0
30 Jun 2019
Enhancing the Locality and Breaking the Memory Bottleneck of Transformer on Time Series Forecasting
Shiyang Li
Xiaoyong Jin
Yao Xuan
Xiyou Zhou
Wenhu Chen
Yu Wang
Xifeng Yan
AI4TS
204
1,451
0
29 Jun 2019
Curriculum Learning for Deep Generative Models with Clustering
Deli Zhao
Jiapeng Zhu
Zhenfang Guo
Bo Zhang
GNN
85
2
0
27 Jun 2019
RUSLAN: Russian Spoken Language Corpus for Speech Synthesis
Lenar Gabdrakhmanov
Rustem Garaev
E. Razinkov
52
10
0
26 Jun 2019
End-to-End Emotional Speech Synthesis Using Style Tokens and Semi-Supervised Training
Peng Wu
Zhenhua Ling
Li-Juan Liu
Yuan Jiang
Hong-Chuan Wu
Lirong Dai
98
72
0
26 Jun 2019
Non-Parallel Sequence-to-Sequence Voice Conversion with Disentangled Linguistic and Speaker Representations
Jing-Xuan Zhang
Zhenhua Ling
Lirong Dai
100
99
0
25 Jun 2019
SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech
Shreyas Seshadri
Okko Räsänen
23
10
0
24 Jun 2019
A Neural Vocoder with Hierarchical Generation of Amplitude and Phase Spectra for Statistical Parametric Speech Synthesis
Yang Ai
Zhenhua Ling
123
29
0
23 Jun 2019
Universal Approximation of Input-Output Maps by Temporal Convolutional Nets
Joshua Hanson
Maxim Raginsky
AI4TS
62
6
0
21 Jun 2019
Black-Box Inference for Non-Linear Latent Force Models
W. Ward
Tom Ryder
D. Prangle
Mauricio A. Alvarez
DRL
80
14
0
21 Jun 2019
Singing Voice Synthesis Using Deep Autoregressive Neural Networks for Acoustic Modeling
Yuanhao Yi
Yang Ai
Zhenhua Ling
Lirong Dai
56
33
0
21 Jun 2019
Learning Disentangled Representations of Timbre and Pitch for Musical Instrument Sounds Using Gaussian Mixture Variational Autoencoders
Yin-Jyun Luo
Kat R. Agres
Dorien Herremans
103
46
0
19 Jun 2019
Disentangled Inference for GANs with Latently Invertible Autoencoder
Jiapeng Zhu
Deli Zhao
Bo Zhang
Bolei Zhou
GAN
DRL
109
35
0
19 Jun 2019
A Unified Speaker Adaptation Method for Speech Synthesis using Transcribed and Untranscribed Speech with Backpropagation
Hieu-Thi Luong
Junichi Yamagishi
74
10
0
18 Jun 2019
Pose Guided Fashion Image Synthesis Using Deep Generative Model
Wei Sun
Jawadul H. Bappy
Shanglin Yang
Yi Tian Xu
Tianfu Wu
Hui Zhou
56
12
0
17 Jun 2019
Learning Execution through Neural Code Fusion
Zhan Shi
Kevin Swersky
Daniel Tarlow
Parthasarathy Ranganathan
Milad Hashemi
GNN
116
29
0
17 Jun 2019
ASAC: Active Sensing using Actor-Critic models
Jinsung Yoon
James Jordon
M. Schaar
CML
59
16
0
16 Jun 2019
Parametric Resynthesis with neural vocoders
Soumi Maiti
Michael I. Mandel
68
19
0
16 Jun 2019
Stand-Alone Self-Attention in Vision Models
Prajit Ramachandran
Niki Parmar
Ashish Vaswani
Irwan Bello
Anselm Levskaya
Jonathon Shlens
VLM
SLR
ViT
173
1,217
0
13 Jun 2019
GluonTS: Probabilistic Time Series Models in Python
A. Alexandrov
Konstantinos Benidis
Michael Bohlke-Schneider
Valentin Flunkert
Jan Gasthaus
...
David Salinas
J. Schulz
Lorenzo Stella
Ali Caner Türkmen
Bernie Wang
BDL
AI4TS
75
115
0
12 Jun 2019
Toward Interpretable Music Tagging with Self-Attention
Minz Won
Sanghyuk Chun
Xavier Serra
ViT
74
82
0
12 Jun 2019
Probabilistic Forecasting with Temporal Convolutional Neural Network
Yitian Chen
Yanfei Kang
Yixiong Chen
Zizhuo Wang
BDL
AI4TS
119
331
0
11 Jun 2019
Parallel Scheduled Sampling
Daniel Duckworth
Arvind Neelakantan
Ben Goodrich
Lukasz Kaiser
Samy Bengio
77
23
0
11 Jun 2019
Neural Spline Flows
Conor Durkan
Artur Bekasov
Iain Murray
George Papamakarios
DRL
236
778
0
10 Jun 2019
Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis
Eric Battenberg
Soroosh Mariooryad
Daisy Stanton
RJ Skerry-Ryan
Matt Shannon
David Kao
Tom Bagby
BDL
107
45
0
08 Jun 2019
TransNet: A deep network for fast detection of common shot transitions
Tomás Soucek
Jaroslav Moravec
Jakub Lokoč
42
31
0
08 Jun 2019
KERMIT: Generative Insertion-Based Modeling for Sequences
William Chan
Nikita Kitaev
Kelvin Guu
Mitchell Stern
Jakob Uszkoreit
VLM
96
65
0
04 Jun 2019
Effective LHC measurements with matrix elements and machine learning
Johann Brehmer
Kyle Cranmer
Irina Espejo
F. Kling
Gilles Louppe
J. Pavez
81
14
0
04 Jun 2019
Text-based Editing of Talking-head Video
Ohad Fried
A. Tewari
Michael Zollhöfer
Adam Finkelstein
Eli Shechtman
Dan B. Goldman
Kyle Genova
Zeyu Jin
Christian Theobalt
Maneesh Agrawala
VGen
110
262
0
04 Jun 2019
Previous
1
2
3
...
47
48
49
...
60
61
62
Next