Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.03499
Cited By
v1
v2 (latest)
WaveNet: A Generative Model for Raw Audio
12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"WaveNet: A Generative Model for Raw Audio"
50 / 3,082 papers shown
Title
Unsupervised Representation Learning for Time Series with Temporal Neighborhood Coding
S. Tonekaboni
Danny Eytan
Anna Goldenberg
CML
SSL
AI4TS
171
298
0
01 Jun 2021
Enhancing Trajectory Prediction using Sparse Outputs: Application to Team Sports
Brandon Victor
Aiden Nibali
Zhen He
D. Carey
39
9
0
01 Jun 2021
Continual 3D Convolutional Neural Networks for Real-time Processing of Videos
Lukas Hedegaard
Alexandros Iosifidis
3DPC
89
15
0
31 May 2021
StarGAN-ZSVC: Towards Zero-Shot Voice Conversion in Low-Resource Contexts
Matthew Baas
Herman Kamper
53
6
0
31 May 2021
Multi-Scale Temporal Convolution Network for Classroom Voice Detection
Lu Ma
Xintian Wang
Song Yang
Y. Gong
Zhongqin Wu
47
1
0
31 May 2021
Cascaded Diffusion Models for High Fidelity Image Generation
Jonathan Ho
Chitwan Saharia
William Chan
David J. Fleet
Mohammad Norouzi
Tim Salimans
279
1,246
0
30 May 2021
DiffSVC: A Diffusion Probabilistic Model for Singing Voice Conversion
Songxiang Liu
Yuewen Cao
Jane Polak Scowcroft
Helen Meng
DiffM
86
59
0
28 May 2021
DIVE: End-to-end Speech Diarization via Iterative Speaker Embedding
Neil Zeghidour
O. Teboul
David Grangier
63
13
0
28 May 2021
Safe Model-based Off-policy Reinforcement Learning for Eco-Driving in Connected and Automated Hybrid Electric Vehicles
Zhaoxuan Zhu
Nicola Pivaro
Shobhit Gupta
Abhishek Gupta
Marcello Canova
OffRL
65
37
0
25 May 2021
Inclusion of Domain-Knowledge into GNNs using Mode-Directed Inverse Entailment
T. Dash
A. Srinivasan
A. Baskar
72
13
0
22 May 2021
Spatial-temporal Conv-sequence Learning with Accident Encoding for Traffic Flow Prediction
Zichuan Liu
Rui Zhang
Chen Wang
Zhu Xiao
Hongbo Jiang
AI4TS
49
20
0
21 May 2021
LoopNet: Musical Loop Synthesis Conditioned On Intuitive Musical Parameters
Pritish Chandna
António Ramires
Xavier Serra
Emilia Gómez
61
4
0
21 May 2021
Temporal convolutional networks predict dynamic oxygen uptake response from wearable sensors across exercise intensities
Robert Amelard
E. Hedge
R. Hughson
26
18
0
20 May 2021
High-Fidelity and Low-Latency Universal Neural Vocoder based on Multiband WaveRNN with Data-Driven Linear Prediction for Discrete Waveform Modeling
Patrick Lumban Tobing
Tomoki Toda
67
8
0
20 May 2021
Parallel and Flexible Sampling from Autoregressive Models via Langevin Dynamics
V. Jayaram
John Thickstun
DiffM
107
25
0
17 May 2021
Itsy Bitsy SpiderNet: Fully Connected Residual Network for Fraud Detection
S. Afanasiev
A. Smirnova
D. Kotereva
64
2
0
17 May 2021
ItôTTS and ItôWave: Linear Stochastic Differential Equation Is All You Need For Audio Generation
Shoule Wu
Ziqiang Shi
DiffM
157
11
0
17 May 2021
Drill the Cork of Information Bottleneck by Inputting the Most Important Data
Xinyu Peng
Jiawei Zhang
Feiyue Wang
Li Li
43
6
0
15 May 2021
Predicting speech intelligibility from EEG in a non-linear classification paradigm
Bernd Accou
Mohammad Jalilpour-Monesi
Hugo Van hamme
T. Francart
22
12
0
14 May 2021
Advances in Machine and Deep Learning for Modeling and Real-time Detection of Multi-Messenger Sources
Eliu A. Huerta
Zhizhen Zhao
106
21
0
13 May 2021
Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech
Vadim Popov
Ivan Vovk
Vladimir Gogoryan
Tasnima Sadekova
Mikhail Kudinov
DiffM
119
544
0
13 May 2021
Diffusion Models Beat GANs on Image Synthesis
Prafulla Dhariwal
Alex Nichol
496
8,017
0
11 May 2021
Lawformer: A Pre-trained Language Model for Chinese Legal Long Documents
Chaojun Xiao
Xueyu Hu
Zhiyuan Liu
Cunchao Tu
Maosong Sun
AILaw
ELM
104
245
0
09 May 2021
Machine Learning (ML)-Centric Resource Management in Cloud Computing: A Review and Future Directions
Tahseen Khan
Wenhong Tian
Rajkumar Buyya
58
106
0
09 May 2021
Latency-Controlled Neural Architecture Search for Streaming Speech Recognition
Liqiang He
Shulin Feng
Jane Polak Scowcroft
Dong Yu
54
0
0
08 May 2021
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
Jinglin Liu
Chengxi Li
Yi Ren
Feiyang Chen
Zhou Zhao
DiffM
198
271
0
06 May 2021
Non-Autoregressive vs Autoregressive Neural Networks for System Identification
Daniel Weber
C. Gühmann
58
7
0
05 May 2021
CoSA: Scheduling by Constrained Optimization for Spatial Accelerators
Qijing Huang
Minwoo Kang
Grace Dinh
Thomas Norell
Aravind Kalaiah
J. Demmel
J. Wawrzynek
Y. Shao
72
112
0
05 May 2021
VQCPC-GAN: Variable-Length Adversarial Audio Synthesis Using Vector-Quantized Contrastive Predictive Coding
J. Nistal
Cyran Aouameur
Stefan Lattner
G. Richard
105
7
0
04 May 2021
OCTOPUS: Overcoming Performance andPrivatization Bottlenecks in Distributed Learning
Shuo Wang
Surya Nepal
Kristen Moore
M. Grobler
Carsten Rudolph
A. Abuadbba
FedML
69
8
0
03 May 2021
Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization
Michael Ruogu Zhang
T. Paine
Ofir Nachum
Cosmin Paduraru
George Tucker
Ziyun Wang
Mohammad Norouzi
OffRL
91
49
0
28 Apr 2021
Learning deep autoregressive models for hierarchical data
Carl R. Andersson
Niklas Wahlström
Thomas B. Schon
BDL
57
3
0
28 Apr 2021
BeamLearning: an end-to-end Deep Learning approach for the angular localization of sound sources using raw multichannel acoustic pressure data
Hadrien Pujol
Éric Bavu
Alexandre Garcia
90
22
0
27 Apr 2021
Sifting out the features by pruning: Are convolutional networks the winning lottery ticket of fully connected ones?
Franco Pellegrini
Giulio Biroli
111
6
0
27 Apr 2021
End-to-End Video-To-Speech Synthesis using Generative Adversarial Networks
Rodrigo Mira
Konstantinos Vougioukas
Pingchuan Ma
Stavros Petridis
Björn W. Schuller
Maja Pantic
121
47
0
27 Apr 2021
Points2Sound: From mono to binaural audio using 3D point cloud scenes
Francesc Lluís
V. Chatziioannou
A. Hofmann
3DPC
120
6
0
26 Apr 2021
Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis
Erica Cooper
Xin Wang
Junichi Yamagishi
91
6
0
25 Apr 2021
Restoring degraded speech via a modified diffusion model
Jianwei Zhang
Suren Jayasuriya
Visar Berisha
DiffM
64
21
0
22 Apr 2021
Scaling of neural-network quantum states for time evolution
Sheng-Hsuan Lin
F. Pollmann
66
25
0
21 Apr 2021
Lossless Compression with Latent Variable Models
James Townsend
BDL
DRL
78
6
0
21 Apr 2021
Eye Know You: Metric Learning for End-to-end Biometric Authentication Using Eye Movements from a Longitudinal Dataset
Dillon Lohr
Henry K. Griffith
Oleg V. Komogortsev
89
33
0
21 Apr 2021
Superpixels and Graph Convolutional Neural Networks for Efficient Detection of Nutrient Deficiency Stress from Aerial Imagery
Saba Dadsetan
David Pichler
David Wilson
N. Hovakimyan
Jennifer Hobbs
99
6
0
20 Apr 2021
VideoGPT: Video Generation using VQ-VAE and Transformers
Wilson Yan
Yunzhi Zhang
Pieter Abbeel
A. Srinivas
ViT
VGen
345
513
0
20 Apr 2021
Review of end-to-end speech synthesis technology based on deep learning
Zhaoxi Mu
Xinyu Yang
Yizhuo Dong
AuLLM
ALM
94
25
0
20 Apr 2021
Mapping the Internet: Modelling Entity Interactions in Complex Heterogeneous Networks
Šimon Mandlík
Tomás Pevný
52
5
0
19 Apr 2021
Recursive input and state estimation: A general framework for learning from time series with missing data
Alberto García-Durán
Robert West
AI4TS
30
2
0
17 Apr 2021
KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis Dataset
Saida Mussakhojayeva
Aigerim Janaliyeva
A. Mirzakhmetov
Yerbolat Khassanov
H. A. Varol
61
14
0
17 Apr 2021
MeshTalk: 3D Face Animation from Speech using Cross-Modality Disentanglement
Alexander Richard
Michael Zollhoefer
Yandong Wen
Fernando de la Torre
Yaser Sheikh
CVBM
97
202
0
16 Apr 2021
Efficient and Generic 1D Dilated Convolution Layer for Deep Learning
Narendra Chaudhary
Sanchit Misra
Dhiraj D. Kalamkar
A. Heinecke
E. Georganas
Barukh Ziv
Menachem Adelman
Bharat Kaul
61
9
0
16 Apr 2021
Spectrogram Inpainting for Interactive Generation of Instrument Sounds
Théis Bazin
Gaëtan Hadjeres
P. Esling
M. Malt
61
11
0
15 Apr 2021
Previous
1
2
3
...
30
31
32
...
60
61
62
Next