Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.03499
Cited By
v1
v2 (latest)
WaveNet: A Generative Model for Raw Audio
12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"WaveNet: A Generative Model for Raw Audio"
50 / 3,082 papers shown
Title
PixelSNAIL: An Improved Autoregressive Generative Model
Xi Chen
Nikhil Mishra
Mostafa Rohaninejad
Pieter Abbeel
DRL
DiffM
BDL
GAN
80
276
0
28 Dec 2017
CNN Is All You Need
Qiming Chen
R. Wu
AIMat
17
17
0
27 Dec 2017
Towards Structured Analysis of Broadcast Badminton Videos
Anurag Ghosh
Suriya Singh
C. V. Jawahar
48
46
0
23 Dec 2017
On Using Backpropagation for Speech Texture Generation and Voice Conversion
J. Chorowski
Ron J. Weiss
Rif A. Saurous
Samy Bengio
43
19
0
22 Dec 2017
Adversarial Examples: Attacks and Defenses for Deep Learning
Xiaoyong Yuan
Pan He
Qile Zhu
Xiaolin Li
SILM
AAML
156
1,629
0
19 Dec 2017
Dynamic Weight Alignment for Temporal Convolutional Neural Networks
Brian Kenji Iwana
S. Uchida
AI4TS
59
8
0
18 Dec 2017
Generating and designing DNA with deep generative models
N. Killoran
Leo J. Lee
Andrew Delong
David Duvenaud
B. Frey
AI4CE
63
147
0
17 Dec 2017
Deep Learning for Distant Speech Recognition
Mirco Ravanelli
79
16
0
17 Dec 2017
Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions
Jonathan Shen
Ruoming Pang
Ron J. Weiss
M. Schuster
Navdeep Jaitly
...
Yuxuan Wang
RJ Skerry-Ryan
Rif A. Saurous
Yannis Agiomyrgiannakis
Yonghui Wu
101
2,707
0
16 Dec 2017
A Hierarchical Recurrent Neural Network for Symbolic Melody Generation
Jian Wu
Changran Hu
Yulong Wang
Xiaolin Hu
Jun Zhu
75
84
0
14 Dec 2017
DLR : Toward a deep learned rhythmic representation for music content analysis
Yeon-ju Jeong
Keunwoo Choi
Hosan Jeong
114
4
0
14 Dec 2017
Motion Switching with Sensory and Instruction Signals by designing Dynamical Systems using Deep Neural Network
Kanata Suzuki
Hiroki Mori
T. Ogata
52
20
0
14 Dec 2017
Over the Air Deep Learning Based Radio Signal Classification
Tim O'Shea
Tamoghna Roy
T. Clancy
78
1,096
0
13 Dec 2017
Music Generation by Deep Learning - Challenges and Directions
Jean-Pierre Briot
F. Pachet
MGen
99
131
0
09 Dec 2017
Learning to Fuse Music Genres with Generative Adversarial Dual Learning
Zhiqian Chen
Chih-Wei Wu
Yen-Cheng Lu
Alexander Lerch
Chang-Tien Lu
GAN
55
8
0
05 Dec 2017
Visual to Sound: Generating Natural Sound for Videos in the Wild
Yipin Zhou
Zhaowen Wang
Chen Fang
Trung Bui
Tamara L. Berg
VGen
98
209
0
04 Dec 2017
Learning Fast and Slow: PROPEDEUTICA for Real-time Malware Detection
Ruimin Sun
Xiaoyong Yuan
Pan He
Qile Zhu
Aokun Chen
André Grégio
Daniela Oliveira
Xiaolin Li
AAML
42
11
0
04 Dec 2017
Spatial PixelCNN: Generating Images from Patches
Nader Akoury
Anh Totti Nguyen
62
4
0
03 Dec 2017
DeepWear: Adaptive Local Offloading for On-Wearable Deep Learning
Mengwei Xu
Feng Qian
Mengze Zhu
Feifan Huang
Saumay Pushp
Xuanzhe Liu
71
23
0
01 Dec 2017
Utilizing Domain Knowledge in End-to-End Audio Processing
T. M. S. Tax
J. Antich
Hendrik Purwins
Lars Maaløe
19
8
0
01 Dec 2017
Wavenet based low rate speech coding
W. Kleijn
Felicia S. C. Lim
Alejandro Luebs
Jan Skoglund
Florian Stimberg
Quan Wang
Thomas C. Walters
61
143
0
01 Dec 2017
Time Domain Neural Audio Style Transfer
P. Mital
60
12
0
29 Nov 2017
A Multi-Horizon Quantile Recurrent Forecaster
Ruofeng Wen
Kari Torkkola
Balakrishnan Narayanaswamy
Dhruv Madeka
BDL
AI4TS
65
433
0
29 Nov 2017
TensorFlow Distributions
Joshua V. Dillon
I. Langmore
Dustin Tran
E. Brevdo
Srinivas Vasudevan
David A. Moore
Brian Patton
Alexander A. Alemi
Matt Hoffman
Rif A. Saurous
GP
123
352
0
28 Nov 2017
Parallel WaveNet: Fast High-Fidelity Speech Synthesis
Aaron van den Oord
Yazhe Li
Igor Babuschkin
Karen Simonyan
Oriol Vinyals
...
Alex Graves
Helen King
T. Walters
Dan Belov
Demis Hassabis
233
859
0
28 Nov 2017
Quantifying the Effects of Enforcing Disentanglement on Variational Autoencoders
Momchil Peychev
Petar Velickovic
Pietro Lio
CoGe
DRL
86
0
0
24 Nov 2017
Invariance of Weight Distributions in Rectified MLPs
Russell Tsuchida
Farbod Roosta-Khorasani
M. Gallagher
MLT
120
36
0
24 Nov 2017
Non-local Neural Networks
Xinyu Wang
Ross B. Girshick
Abhinav Gupta
Kaiming He
OffRL
366
8,948
0
21 Nov 2017
JamBot: Music Theory Aware Chord Based Generation of Polyphonic Music with LSTMs
Gino Brunner
Yuyi Wang
Roger Wattenhofer
Jonas Wiesendanger
MGen
66
49
0
21 Nov 2017
Speech Dereverberation with Context-aware Recurrent Neural Networks
J. F. Santos
T. Falk
47
36
0
16 Nov 2017
How Generative Adversarial Networks and Their Variants Work: An Overview
Yongjun Hong
Uiwon Hwang
Jaeyoon Yoo
Sungroh Yoon
GAN
131
159
0
16 Nov 2017
Emotional End-to-End Neural Speech Synthesizer
Younggun Lee
Azam Rabiee
Soo-Young Lee
93
106
0
15 Nov 2017
Non-Autoregressive Neural Machine Translation
Jiatao Gu
James Bradbury
Caiming Xiong
Victor O.K. Li
R. Socher
107
798
0
07 Nov 2017
Convolutional Normalizing Flows
Guoqing Zheng
Yiming Yang
J. Carbonell
BDL
72
11
0
07 Nov 2017
Wider and Deeper, Cheaper and Faster: Tensorized LSTMs for Sequence Learning
Zhen He
Shaobing Gao
Liang Xiao
Daxue Liu
Hangen He
David Barber
AIMat
92
65
0
05 Nov 2017
Learning Filterbanks from Raw Speech for Phone Recognition
Neil Zeghidour
Nicolas Usunier
Iasonas Kokkinos
Thomas Schatz
Gabriel Synnaeve
Emmanuel Dupoux
76
120
0
03 Nov 2017
Neural Discrete Representation Learning
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
259
5,093
0
02 Nov 2017
Hi, how can I help you?: Automating enterprise IT support help desks
Senthil Mani
Neelamadhav Gantayat
Rahul Aralikatte
Monika Gupta
Sampath Dechu
A. Sankaran
Shreya Khare
Barry Mitchell
H. Subramanian
Hema Venkatarangan
47
13
0
02 Nov 2017
TasNet: time-domain audio separation network for real-time, single-channel speech separation
Yi Luo
N. Mesgarani
111
633
0
01 Nov 2017
Uncovering Latent Style Factors for Expressive Speech Synthesis
Yuxuan Wang
RJ Skerry-Ryan
Y. Xiao
Daisy Stanton
Joel Shor
Eric Battenberg
R. Clark
Rif A. Saurous
63
53
0
01 Nov 2017
Melody Generation for Pop Music via Word Representation of Musical Properties
Andrew Shin
Léopold Crestel
Hiroharu Kato
Kuniaki Saito
Katsunori Ohnishi
Masataka Yamaguchi
Masahiro Nakawaki
Yoshitaka Ushiku
Tatsuya Harada
MGen
72
12
0
31 Oct 2017
Audio style transfer
Eric Grinstein
Ngoc Q. K. Duong
A. Ozerov
P. Pérez
71
68
0
31 Oct 2017
Onsets and Frames: Dual-Objective Piano Transcription
Curtis Hawthorne
Erich Elsen
Jialin Song
Adam Roberts
Ian Simon
Colin Raffel
Jesse Engel
Sageev Oore
Douglas Eck
204
281
0
30 Oct 2017
JSUT corpus: free large-scale Japanese speech corpus for end-to-end speech synthesis
Ryosuke Sonobe
Shinnosuke Takamichi
Hiroshi Saruwatari
3DV
105
140
0
28 Oct 2017
Multi-level Residual Networks from Dynamical Systems View
B. Chang
Lili Meng
E. Haber
Frederick Tung
David Begert
96
172
0
27 Oct 2017
Progressive Growing of GANs for Improved Quality, Stability, and Variation
Tero Karras
Timo Aila
S. Laine
J. Lehtinen
GAN
233
7,395
0
27 Oct 2017
Malware Detection by Eating a Whole EXE
Edward Raff
Jon Barker
Jared Sylvester
Robert Brandon
Bryan Catanzaro
Charles K. Nicholas
89
549
0
25 Oct 2017
Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention
Hideyuki Tachibana
Katsuya Uenoyama
Shunsuke Aihara
76
267
0
24 Oct 2017
Listening to the World Improves Speech Command Recognition
B. McMahan
D. Rao
102
38
0
23 Oct 2017
Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning
Ming-Yu Liu
Kainan Peng
Andrew Gibiansky
Sercan O. Arik
Ajay Kannan
Sharan Narang
Jonathan Raiman
John Miller
102
309
0
20 Oct 2017
Previous
1
2
3
...
58
59
60
61
62
Next