Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.03499
Cited By
v1
v2 (latest)
WaveNet: A Generative Model for Raw Audio
12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"WaveNet: A Generative Model for Raw Audio"
50 / 3,082 papers shown
Title
Learning Gradient Fields for Shape Generation
Ruojin Cai
Guandao Yang
Hadar Averbuch-Elor
Jinwei Gu
Serge J. Belongie
Noah Snavely
B. Hariharan
3DPC
134
286
0
14 Aug 2020
Textual Echo Cancellation
Shaojin Ding
Ye Jia
Ke Hu
Quan Wang
81
8
0
13 Aug 2020
Prosody Learning Mechanism for Speech Synthesis System Without Text Length Limit
Zhen Zeng
Jianzong Wang
Ning Cheng
Jing Xiao
61
8
0
13 Aug 2020
Audio- and Gaze-driven Facial Animation of Codec Avatars
Alexander Richard
Colin S. Lea
Shugao Ma
Juergen Gall
Fernando de la Torre
Yaser Sheikh
CVBM
77
81
0
11 Aug 2020
Bunched LPCNet : Vocoder for Low-cost Neural Text-To-Speech Systems
Ravichander Vipperla
Sangjun Park
Kihyun Choo
Samin S. Ishtiaq
Kyoungbo Min
S. Bhattacharya
Abhinav Mehrotra
Alberto Gil C. P. Ramos
Nicholas D. Lane
72
26
0
11 Aug 2020
DeepDrummer : Generating Drum Loops using Deep Learning and a Human in the Loop
Guillaume Alain
Maxime Chevalier-Boisvert
Frédéric Osterrath
Remi Piche-Taillefer
HAI
81
6
0
10 Aug 2020
Deep learning for photoacoustic imaging: a survey
Changchun Yang
Hengrong Lan
Feng Gao
Fei Gao
VLM
MedIm
59
21
0
10 Aug 2020
SpeedySpeech: Efficient Neural Speech Synthesis
Jan Vainer
Ondrej Dusek
66
43
0
09 Aug 2020
Speaker Conditional WaveRNN: Towards Universal Neural Vocoder for Unseen Speaker and Recording Conditions
D. Paul
Yannis Pantazis
Y. Stylianou
DRL
59
30
0
09 Aug 2020
Deep MOS Predictor for Synthetic Speech Using Cluster-Based Modeling
Yeunju Choi
Youngmoon Jung
Hoirin Kim
139
26
0
09 Aug 2020
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep Learning
Berrak Sisman
Junichi Yamagishi
Simon King
Haizhou Li
BDL
141
329
0
09 Aug 2020
Pretraining Techniques for Sequence-to-Sequence Voice Conversion
Wen-Chin Huang
Tomoki Hayashi
Yi-Chiao Wu
Hirokazu Kameoka
Tomoki Toda
121
40
0
07 Aug 2020
A Multi-Task Learning Approach for Human Activity Segmentation and Ergonomics Risk Assessment
Behnoosh Parsa
A. Banerjee
97
2
0
07 Aug 2020
DurIAN-SC: Duration Informed Attention Network based Singing Voice Conversion System
Liqiang Zhang
Chengzhu Yu
Heng Lu
Chao Weng
Chunlei Zhang
Yusong Wu
Xiang Xie
Zijin Li
Dong Yu
68
34
0
07 Aug 2020
Unsupervised Cross-Domain Singing Voice Conversion
Adam Polyak
Lior Wolf
Yossi Adi
Yaniv Taigman
58
44
0
06 Aug 2020
Generative Adversarial Networks for Image and Video Synthesis: Algorithms and Applications
Xuan Li
Xun Huang
Jiahui Yu
Ting-Chun Wang
Arun Mallya
GAN
170
155
0
06 Aug 2020
HooliGAN: Robust, High Quality Neural Vocoding
Ollie McCarthy
Zo Ahmed
98
14
0
06 Aug 2020
Zero-Shot Multi-View Indoor Localization via Graph Location Networks
Meng-Jiun Chiou
Zhenguang Liu
Yifang Yin
Anan Liu
Roger Zimmermann
64
27
0
06 Aug 2020
PPSpeech: Phrase based Parallel End-to-End TTS System
Yahuan Cong
Ran Zhang
Jian Luan
47
3
0
06 Aug 2020
Ultrasound-based Articulatory-to-Acoustic Mapping with WaveGlow Speech Synthesis
Tamás Gábor Csapó
Csaba Zainkó
L. Tóth
G. Gosztolya
Alexandra Markó
61
33
0
06 Aug 2020
Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning
Jing-Xuan Zhang
Zhenhua Ling
Lirong Dai
81
6
0
05 Aug 2020
Expressive TTS Training with Frame and Style Reconstruction Loss
Rui Liu
Berrak Sisman
Guanglai Gao
Haizhou Li
112
73
0
04 Aug 2020
Neural Granular Sound Synthesis
Adrien Bitton
P. Esling
Tatsuya Harada
83
7
0
04 Aug 2020
A Spectral Energy Distance for Parallel Speech Synthesis
A. Gritsenko
Tim Salimans
Rianne van den Berg
Jasper Snoek
Nal Kalchbrenner
76
70
0
03 Aug 2020
One Model, Many Languages: Meta-learning for Multilingual Text-to-Speech
Tomás Nekvinda
Ondrej Dusek
72
57
0
03 Aug 2020
Unacceptable, where is my privacy? Exploring Accidental Triggers of Smart Speakers
Lea Schonherr
Maximilian Golla
Thorsten Eisenhofer
Jan Wiele
D. Kolossa
Thorsten Holz
63
41
0
02 Aug 2020
Principles and Algorithms for Forecasting Groups of Time Series: Locality and Globality
Pablo Montero-Manso
Rob J. Hyndman
AI4TS
102
139
0
02 Aug 2020
Diet deep generative audio models with structured lottery
P. Esling
Ninon Devis
Adrien Bitton
Antoine Caillon
Axel Chemla-Romeu-Santos
Constance Douwes
105
6
0
31 Jul 2020
Traffic Control Gesture Recognition for Autonomous Vehicles
Julian Wiederer
Arij Bouazizi
U. Kressel
Vasileios Belagiannis
95
53
0
31 Jul 2020
An Empirical Survey of Data Augmentation for Time Series Classification with Neural Networks
Brian Kenji Iwana
S. Uchida
AI4TS
90
506
0
31 Jul 2020
Rewriting a Deep Generative Model
David Bau
Steven Liu
Tongzhou Wang
Jun-Yan Zhu
Antonio Torralba
GAN
DRL
108
140
0
30 Jul 2020
Anomaly Detection at Scale: The Case for Deep Distributional Time Series Models
Fadhel Ayed
Lorenzo Stella
Tim Januschowski
Jan Gasthaus
AI4TS
103
10
0
30 Jul 2020
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Jinhyeok Yang
Junmo Lee
Young-Ik Kim
Hoonyoung Cho
Injung Kim
85
73
0
30 Jul 2020
Privacy-preserving Voice Analysis via Disentangled Representations
Ranya Aloufi
Hamed Haddadi
David E. Boyle
DRL
130
58
0
29 Jul 2020
TrajGAIL: Generating Urban Vehicle Trajectories using Generative Adversarial Imitation Learning
Seongjin Choi
Jiwon Kim
H. Yeo
GAN
100
131
0
28 Jul 2020
Federated Self-Supervised Learning of Multi-Sensor Representations for Embedded Intelligence
Aaqib Saeed
Flora D. Salim
T. Ozcelebi
J. Lukkien
FedML
SSL
193
100
0
25 Jul 2020
Towards Game Design via Creative Machine Learning (GDCML)
Anurag Sarkar
Seth Cooper
AI4CE
72
21
0
25 Jul 2020
Multi-speaker Emotion Conversion via Latent Variable Regularization and a Chained Encoder-Decoder-Predictor Network
Ravi Shankar
Hsi-Wei Hsieh
N. Charon
A. Venkataraman
120
11
0
25 Jul 2020
Non-parallel Emotion Conversion using a Deep-Generative Hybrid Network and an Adversarial Pair Discriminator
Ravi Shankar
Jacob Sager
A. Venkataraman
GAN
117
20
0
25 Jul 2020
Robust Front-End for Multi-Channel ASR using Flow-Based Density Estimation
Xiaoyuan Yi
Hyeonseung Lee
Wenhao Li
Hyung Yong Kim
Nam Soo Kim
84
22
0
25 Jul 2020
Hierarchical Protein Function Prediction with Tail-GNNs
Stefan Spalević
Petar Velivcković
Jovana Kovavcević
Mladen Nikolic
AI4CE
66
5
0
24 Jul 2020
SeismoFlow -- Data augmentation for the class imbalance problem
R. Milidiú
Luis Müller
AI4TS
30
1
0
23 Jul 2020
A Transfer Learning End-to-End ArabicText-To-Speech (TTS) Deep Architecture
Fady K. Fahmy
M. Khalil
Hazem M. Abbas
55
21
0
22 Jul 2020
CrossTransformers: spatially-aware few-shot transfer
Carl Doersch
Ankush Gupta
Andrew Zisserman
ViT
294
338
0
22 Jul 2020
Rethinking CNN Models for Audio Classification
Kamalesh Palanisamy
Dipika Singhania
Angela Yao
SSL
83
146
0
22 Jul 2020
Foley Music: Learning to Generate Music from Videos
Chuang Gan
Deng Huang
Peihao Chen
J. Tenenbaum
Antonio Torralba
VGen
75
139
0
21 Jul 2020
Deep Learning Techniques for Future Intelligent Cross-Media Retrieval
S. Rehman
M. Waqas
Shanshan Tu
Anis Koubaa
O. Rehman
Jawad Ahmad
Muhammad Hanif
Zhu Han
49
6
0
21 Jul 2020
Complex Skill Acquisition Through Simple Skill Imitation Learning
Pranay Pasula
8
0
0
20 Jul 2020
Temporal Pointwise Convolutional Networks for Length of Stay Prediction in the Intensive Care Unit
Emma Rocheteau
Pietro Lio
Stephanie L. Hyland
OOD
64
60
0
18 Jul 2020
MTL2L: A Context Aware Neural Optimiser
N. Kuo
Mehrtash Harandi
Nicolas Fourrier
Christian J. Walder
Gabriela Ferraro
H. Suominen
36
0
0
18 Jul 2020
Previous
1
2
3
...
37
38
39
...
60
61
62
Next