Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.03499
Cited By
v1
v2 (latest)
WaveNet: A Generative Model for Raw Audio
12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"WaveNet: A Generative Model for Raw Audio"
50 / 3,082 papers shown
Title
Action Segmentation with Mixed Temporal Domain Adaptation
Min-Hung Chen
Baopu Li
Yingze Bao
Ghassan AlRegib
120
30
0
15 Apr 2021
RIANN -- A Robust Neural Network Outperforms Attitude Estimation Filters
Daniel Weber
C. Gühmann
Thomas Seel
69
37
0
15 Apr 2021
On the Design of Deep Priors for Unsupervised Audio Restoration
V. Narayanaswamy
Jayaraman J. Thiagarajan
A. Spanias
AI4CE
55
5
0
14 Apr 2021
ADNet: Temporal Anomaly Detection in Surveillance Videos
H. Öztürk
Ahmet Burak Can
130
15
0
14 Apr 2021
Neural basis expansion analysis with exogenous variables: Forecasting electricity prices with NBEATSx
Kin G. Olivares
Cristian Challu
Grzegorz Marcjasz
R. Weron
A. Dubrawski
AI4TS
99
151
0
12 Apr 2021
Boltzmann Tuning of Generative Models
Victor Berger
Michele Sebag
57
0
0
12 Apr 2021
Protein sequence design with deep generative models
Zachary Wu
Kadina E. Johnston
F. Arnold
Kevin Kaichuang Yang
92
142
0
09 Apr 2021
The AS-NU System for the M2VoC Challenge
Cheng-Hung Hu
Yi-Chiao Wu
Wen-Chin Huang
Yu-Huai Peng
Yu-Wen Chen
Pin-Jui Ku
Tomoki Toda
Yu Tsao
Hsin-Min Wang
54
1
0
07 Apr 2021
Creativity and Machine Learning: A Survey
Giorgio Franceschelli
Mirco Musolesi
VLM
AI4CE
129
43
0
06 Apr 2021
Noise Estimation for Generative Diffusion Models
Robin San-Roman
Eliya Nachmani
Lior Wolf
DiffM
136
107
0
06 Apr 2021
NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling
Junhyeok Lee
Seungu Han
DiffM
83
70
0
06 Apr 2021
Training Deep Normalizing Flow Models in Highly Incomplete Data Scenarios with Prior Regularization
Edgar A. Bernal
43
1
0
03 Apr 2021
Diff-TTS: A Denoising Diffusion Model for Text-to-Speech
Myeonghun Jeong
Hyeongju Kim
Sung Jun Cheon
Byoung Jin Choi
N. Kim
DiffM
72
197
0
03 Apr 2021
PATE-AAE: Incorporating Adversarial Autoencoder into Private Aggregation of Teacher Ensembles for Spoken Command Classification
Chao-Han Huck Yang
Sabato Marco Siniscalchi
Chin-Hui Lee
134
22
0
02 Apr 2021
Learnable Dynamic Temporal Pooling for Time Series Classification
Dongha Lee
Seonghyeon Lee
Hwanjo Yu
AI4TS
92
29
0
02 Apr 2021
Assem-VC: Realistic Voice Conversion by Assembling Modern Speech Synthesis Techniques
Kang-Wook Kim
Seung-won Park
Junhyeok Lee
Myun-chul Joe
76
28
0
02 Apr 2021
Multi-rate attention architecture for fast streamable Text-to-speech spectrum modeling
Qing He
Zhiping Xiu
T. Koehler
Jilong Wu
75
7
0
01 Apr 2021
Speech Resynthesis from Discrete Disentangled Self-Supervised Representations
Adam Polyak
Yossi Adi
Jade Copet
Eugene Kharitonov
Kushal Lakhotia
Wei-Ning Hsu
Abdel-rahman Mohamed
Emmanuel Dupoux
147
318
0
01 Apr 2021
CycleDRUMS: Automatic Drum Arrangement For Bass Lines Using CycleGAN
Giorgio Barnabò
Giovanni Trappolini
L. Lastilla
Cesare Campagnano
Angela Fan
Fabio Petroni
Fabrizio Silvestri
66
4
0
01 Apr 2021
Collaborative Learning to Generate Audio-Video Jointly
V. Kurmi
Vipul Bajaj
Badri N. Patro
K. Venkatesh
Vinay P. Namboodiri
Preethi Jyothi
VGen
60
11
0
01 Apr 2021
Adversarial Attacks and Defenses for Speech Recognition Systems
Piotr Żelasko
Sonal Joshi
Yiwen Shao
Jesus Villalba
J. Trmal
Najim Dehak
Sanjeev Khudanpur
AAML
63
29
0
31 Mar 2021
Wave based damage detection in solid structures using artificial neural networks
Frank Wuttke
Hao Lyu
A. Sattari
Z. Rizvi
27
1
0
30 Mar 2021
iVPF: Numerical Invertible Volume Preserving Flow for Efficient Lossless Compression
Shifeng Zhang
Chen Zhang
Ning Kang
Zhenguo Li
75
38
0
30 Mar 2021
Symbolic Music Generation with Diffusion Models
Gautam Mittal
Jesse Engel
Curtis Hawthorne
Ian Simon
MGen
DiffM
113
194
0
30 Mar 2021
PixelTransformer: Sample Conditioned Signal Generation
Shubham Tulsiani
Abhinav Gupta
76
17
0
29 Mar 2021
A Temporal Kernel Approach for Deep Learning with Continuous-time Information
Da Xu
Chuanwei Ruan
Evren Körpeoglu
Sushant Kumar
Kannan Achan
SyDa
AI4TS
54
5
0
28 Mar 2021
Improved Autoregressive Modeling with Distribution Smoothing
Chenlin Meng
Jiaming Song
Yang Song
Shengjia Zhao
Stefano Ermon
DiffM
76
23
0
28 Mar 2021
PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS
Ye Jia
Heiga Zen
Jonathan Shen
Yu Zhang
Yonghui Wu
SSL
103
84
0
28 Mar 2021
Scalable and Efficient Neural Speech Coding: A Hybrid Design
Kai Zhen
Jongmo Sung
Mi Suk Lee
Seung-Wha Beack
Minje Kim
95
14
0
27 Mar 2021
Online structural health monitoring by model order reduction and deep learning algorithms
Luca Rosafalco
Matteo Torzoni
Andrea Manzoni
S. Mariani
A. Corigliano
OffRL
34
49
0
26 Mar 2021
Improve GAN-based Neural Vocoder using Pointwise Relativistic LeastSquare GAN
Cong Wang
Yu Chen
Bin Wang
Yi Shi
146
1
0
26 Mar 2021
Image2Reverb: Cross-Modal Reverb Impulse Response Synthesis
Nikhil Singh
Jeff Mentch
Jerry Ng
Matthew Beveridge
Iddo Drori
65
47
0
26 Mar 2021
Under Pressure: Learning to Detect Slip with Barometric Tactile Sensors
Abhinav Grover
C. Grebe
Philippe Nadeau
Jonathan Kelly
51
6
0
24 Mar 2021
Learned complex masks for multi-instrument source separation
Andreas Jansson
Rachel M. Bittner
N. Montecchio
Tillman Weyde
17
0
0
23 Mar 2021
CLIP: Cheap Lipschitz Training of Neural Networks
Leon Bungert
René Raab
Tim Roith
Leo Schwinn
Daniel Tenbrinck
59
33
0
23 Mar 2021
Tiny Transformers for Environmental Sound Classification at the Edge
David Elliott
Carlos E. Otero
Steven Wyatt
Evan Martino
81
16
0
22 Mar 2021
An Experimental Review on Deep Learning Architectures for Time Series Forecasting
Pedro Lara-Benítez
Manuel Carranza-García
José Cristóbal Riquelme Santos
AI4TS
109
318
0
22 Mar 2021
Learning physical properties of anomalous random walks using graph neural networks
Hippolyte Verdier
M. Duval
François Laurent
Alhassan Cassé
Christian L. Vestergaard
Jean-Baptiste Masson
71
25
0
22 Mar 2021
SwissDial: Parallel Multidialectal Corpus of Spoken Swiss German
Pelin Dogan-Schönberger
Julian Mäder
Thomas Hofmann
65
30
0
21 Mar 2021
Graph Attention Recurrent Neural Networks for Correlated Time Series Forecasting -- Full version
Razvan-Gabriel Cirstea
Chenjuan Guo
B. Yang
AI4TS
100
43
0
19 Mar 2021
GPNAS: A Neural Network Architecture Search Framework Based on Graphical Predictor
Dige Ai
Hong Zhang
AI4CE
53
0
0
19 Mar 2021
Improving Zero-shot Voice Style Transfer via Disentangled Representation Learning
Siyang Yuan
Pengyu Cheng
Ruiyi Zhang
Weituo Hao
Zhe Gan
Lawrence Carin
DRL
74
61
0
17 Mar 2021
Lightweight and interpretable neural modeling of an audio distortion effect using hyperconditioned differentiable biquads
S. Nercessian
Andy M. Sarroff
K. Werner
40
29
0
15 Mar 2021
Signal Representations for Synthesizing Audio Textures with Generative Adversarial Networks
Chitralekha Gupta
Purnima Kamath
L. Wyse
60
9
0
12 Mar 2021
Latent Space Explorations of Singing Voice Synthesis using DDSP
J. Alonso
Cumhur Erkut
145
12
0
12 Mar 2021
Tensor networks and efficient descriptions of classical data
Sirui Lu
Márton Kanász-Nagy
I. Kukuljan
J. I. Cirac
54
26
0
11 Mar 2021
Variable-rate discrete representation learning
Sander Dieleman
C. Nash
Jesse Engel
Karen Simonyan
BDL
DRL
82
24
0
10 Mar 2021
GAN Vocoder: Multi-Resolution Discriminator Is All You Need
J. You
Dalhyun Kim
Gyuhyeon Nam
Geumbyeol Hwang
Gyeongsu Chae
68
27
0
09 Mar 2021
Deep Generative Modelling: A Comparative Review of VAEs, GANs, Normalizing Flows, Energy-Based and Autoregressive Models
Sam Bond-Taylor
Adam Leach
Yang Long
Chris G. Willcocks
VLM
TPM
193
511
0
08 Mar 2021
CUHK-EE Voice Cloning System for ICASSP 2021 M2VoC Challenge
Daxin Tan
Hingpang Huang
Guangyan Zhang
Tan Lee
65
6
0
08 Mar 2021
Previous
1
2
3
...
31
32
33
...
60
61
62
Next