Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.03499
Cited By
v1
v2 (latest)
WaveNet: A Generative Model for Raw Audio
12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"WaveNet: A Generative Model for Raw Audio"
50 / 3,082 papers shown
Title
Semi-Recurrent CNN-based VAE-GAN for Sequential Data Generation
Mohammad Akbari
Jie Liang
GAN
75
20
0
01 Jun 2018
Backpropagation for Implicit Spectral Densities
Aditya A. Ramesh
Yann LeCun
55
10
0
01 Jun 2018
Inverting Supervised Representations with Autoregressive Neural Density Models
C. Nash
Nate Kushman
Christopher K. I. Williams
DRL
64
25
0
01 Jun 2018
Mining gold from implicit models to improve likelihood-free inference
Johann Brehmer
Gilles Louppe
J. Pavez
Kyle Cranmer
AI4CE
TPM
188
181
0
30 May 2018
Theory and Experiments on Vector Quantized Autoencoders
Aurko Roy
Ashish Vaswani
Arvind Neelakantan
Niki Parmar
91
88
0
28 May 2018
Lipschitz regularity of deep neural networks: analysis and efficient estimation
Kevin Scaman
Aladin Virmaux
158
533
0
28 May 2018
Real-valued parametric conditioning of an RNN for interactive sound synthesis
L. Wyse
38
9
0
28 May 2018
Stable Recurrent Models
John Miller
Moritz Hardt
83
119
0
25 May 2018
ASR-based Features for Emotion Recognition: A Transfer Learning Approach
Noé Tits
Kevin El Haddad
Thierry Dutoit
65
28
0
23 May 2018
CNN+CNN: Convolutional Decoders for Image Captioning
Qingzhong Wang
Antoni B. Chan
VLM
73
86
0
23 May 2018
Generative timbre spaces: regularizing variational auto-encoders with perceptual metrics
P. Esling
Axel Chemla-Romeu-Santos
Adrien Bitton
52
32
0
22 May 2018
Meta-learning with differentiable closed-form solvers
Luca Bertinetto
João F. Henriques
Philip Torr
Andrea Vedaldi
ODL
123
932
0
21 May 2018
A Universal Music Translation Network
Noam Mor
Lior Wolf
Adam Polyak
Yaniv Taigman
89
110
0
21 May 2018
An Evaluation of Trajectory Prediction Approaches and Notes on the TrajNet Benchmark
S. Becker
Ronny Hug
Wolfgang Hubner
Michael Arens
108
71
0
19 May 2018
The global optimum of shallow neural network is attained by ridgelet transform
Sho Sonoda
Isao Ishikawa
Masahiro Ikeda
Kei Hagihara
Y. Sawano
Takuo Matsubara
Noboru Murata
35
1
0
19 May 2018
Number Sequence Prediction Problems for Evaluating Computational Powers of Neural Networks
Hyoungwook Nam
Segwang Kim
Kyomin Jung
AIMat
66
15
0
19 May 2018
Sequential Neural Likelihood: Fast Likelihood-free Inference with Autoregressive Flows
George Papamakarios
D. Sterratt
Iain Murray
BDL
552
370
0
18 May 2018
QuaterNet: A Quaternion-based Recurrent Model for Human Motion
Dario Pavllo
David Grangier
Michael Auli
3DH
76
263
0
16 May 2018
Towards a universal neural network encoder for time series
Joan Serrà
Santiago Pascual
Alexandros Karatzoglou
AI4TS
84
123
0
10 May 2018
Intracranial Error Detection via Deep Learning
M. Völker
Jiří Hammer
R. Schirrmeister
Joos Behncke
L. Fiederer
A. Schulze-Bonhage
Petr Marusič
Wolfram Burgard
T. Ball
65
10
0
04 May 2018
Randomly weighted CNNs for (music) audio classification
Jordi Pons
Xavier Serra
79
86
0
01 May 2018
Collapsed speech segment detection and suppression for WaveNet vocoder
Yi-Chiao Wu
Kazuhiro Kobayashi
Tomoki Hayashi
Patrick Lumban Tobing
Tomoki Toda
65
25
0
30 Apr 2018
Automatic Documentation of ICD Codes with Far-Field Speech Recognition
Albert Haque
Corinna Fukushima
21
0
0
30 Apr 2018
Deep Speech Denoising with Vector Space Projections
Jeff Hetherly
Paul Gamble
M. Barrios
Cory Stephenson
Karl S. Ni
32
0
0
27 Apr 2018
Detection of Glottal Closure Instants from Raw Speech using Convolutional Neural Networks
Mohit Goyal
Varun Srivastava
P. PrathoshA.
40
2
0
26 Apr 2018
JUNIPR: a Framework for Unsupervised Machine Learning in Particle Physics
Anders Andreassen
Ilya Feige
Christopher Frye
M. Schwartz
MU
100
137
0
25 Apr 2018
Speaker-independent raw waveform model for glottal excitation
Lauri Juvela
Vassilis Tsiaras
Bajibabu Bollepalli
Manu Airaksinen
Junichi Yamagishi
P. Alku
54
39
0
25 Apr 2018
A Spoofing Benchmark for the 2018 Voice Conversion Challenge: Leveraging from Spoofing Countermeasures for Speech Artifact Assessment
Tomi Kinnunen
Jaime Lorenzo-Trueba
Junichi Yamagishi
Tomoki Toda
Daisuke Saito
F. Villavicencio
Zhenhua Ling
51
28
0
23 Apr 2018
Deep Layered Learning in MIR
Anders Elowsson
46
4
0
18 Apr 2018
The unreasonable effectiveness of the forget gate
J. Westhuizen
Joan Lasenby
77
89
0
13 Apr 2018
Blood Vessel Geometry Synthesis using Generative Adversarial Networks
J. Wolterink
T. Leiner
Ivana Isgum
GAN
MedIm
43
25
0
12 Apr 2018
The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods
Jaime Lorenzo-Trueba
Junichi Yamagishi
Tomoki Toda
Daisuke Saito
F. Villavicencio
Tomi Kinnunen
Zhenhua Ling
69
321
0
12 Apr 2018
Understanding disentangling in
β
β
β
-VAE
Christopher P. Burgess
I. Higgins
Arka Pal
Loic Matthey
Nicholas Watters
Guillaume Desjardins
Alexander Lerchner
CoGe
DRL
73
832
0
10 Apr 2018
A comparison of recent waveform generation and acoustic modeling methods for neural-network-based speech synthesis
Xin Wang
Jaime Lorenzo-Trueba
Shinji Takaki
Lauri Juvela
Junichi Yamagishi
70
67
0
07 Apr 2018
Expressive Speech Synthesis via Modeling Expressions with Variational Autoencoder
K. Akuzawa
Yusuke Iwasawa
Y. Matsuo
87
139
0
06 Apr 2018
Structured Disentangled Representations
Babak Esmaeili
Hao Wu
Sarthak Jain
Alican Bozkurt
N. Siddharth
Brooks Paige
Dana H. Brooks
Jennifer Dy
Jan-Willem van de Meent
OOD
CML
BDL
DRL
88
169
0
06 Apr 2018
Fine-grained Video Attractiveness Prediction Using Multimodal Deep Learning on a Large Real-world Dataset
Xinpeng Chen
Jingyuan Chen
Lin Ma
Jian Yao
Wen Liu
Jiebo Luo
Tong Zhang
AI4TS
VGen
37
20
0
04 Apr 2018
Music Genre Classification using Machine Learning Techniques
Hareesh Bahuleyan
VLM
34
104
0
03 Apr 2018
Neural Autoregressive Flows
Chin-Wei Huang
David M. Krueger
Alexandre Lacoste
Aaron Courville
DRL
AI4CE
161
447
0
03 Apr 2018
Conditional End-to-End Audio Transforms
Albert Haque
Michelle Guo
Prateek Verma
114
41
0
30 Mar 2018
Parallel Grid Pooling for Data Augmentation
Akito Takeki
Daiki Ikami
Go Irie
Kiyoharu Aizawa
56
7
0
30 Mar 2018
Detecting Alzheimer's Disease Using Gated Convolutional Neural Network from Audio Data
Tifani Warnita
Nakamasa Inoue
Koichi Shinoda
36
40
0
30 Mar 2018
Machine Speech Chain with One-shot Speaker Adaptation
Andros Tjandra
S. Sakti
Satoshi Nakamura
71
56
0
28 Mar 2018
World Models
David R Ha
Jürgen Schmidhuber
SyDa
202
1,102
0
27 Mar 2018
Complex-Valued Restricted Boltzmann Machine for Direct Speech Parameterization from Complex Spectra
Toru Nakashika
Shinji Takaki
Junichi Yamagishi
13
1
0
27 Mar 2018
MOrdReD: Memory-based Ordinal Regression Deep Neural Networks for Time Series Forecasting
Bernardo Pérez Orozco
G. Abbati
Stephen J. Roberts
OOD
AI4TS
38
14
0
26 Mar 2018
HAMLET: Interpretable Human And Machine co-LEarning Technique
Olivier Deiss
Siddharth Biswal
Jing Jin
Haoqi Sun
M. P. M. Brandon Westover
Jimeng Sun
60
11
0
26 Mar 2018
Calibrated Prediction Intervals for Neural Network Regressors
Gil Keren
N. Cummins
Björn Schuller
UQCV
87
31
0
26 Mar 2018
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
Yuxuan Wang
Daisy Stanton
Yu Zhang
RJ Skerry-Ryan
Eric Battenberg
Joel Shor
Y. Xiao
Fei Ren
Ye Jia
Rif A. Saurous
68
827
0
23 Mar 2018
Generalization Challenges for Neural Architectures in Audio Source Separation
Shariq Mobin
Brian Cheung
Bruno A. Olshausen
DRL
46
2
0
23 Mar 2018
Previous
1
2
3
...
56
57
58
...
60
61
62
Next