Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.03499
Cited By
v1
v2 (latest)
WaveNet: A Generative Model for Raw Audio
12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"WaveNet: A Generative Model for Raw Audio"
50 / 3,082 papers shown
Title
Denoising Diffusion Implicit Models
Jiaming Song
Chenlin Meng
Stefano Ermon
VLM
DiffM
355
7,541
0
06 Oct 2020
JSSS: free Japanese speech corpus for summarization and simplification
Shinnosuke Takamichi
Mamoru Komachi
Naoko Tanji
Hiroshi Saruwatari
32
1
0
05 Oct 2020
D3Net: Densely connected multidilated DenseNet for music source separation
Naoya Takahashi
Yuki Mitsufuji
MedIm
183
70
0
05 Oct 2020
VAEBM: A Symbiosis between Variational Autoencoders and Energy-based Models
Zhisheng Xiao
Karsten Kreis
Jan Kautz
Arash Vahdat
120
124
0
01 Oct 2020
MQTransformer: Multi-Horizon Forecasts with Context Dependent and Feedback-Aware Attention
Carson Eisenach
Yagna Patel
Dhruv Madeka
AI4TS
107
37
0
30 Sep 2020
Dilated Convolutional Attention Network for Medical Code Assignment from Clinical Text
Shaoxiong Ji
Min Zhang
Pekka Marttinen
MedIm
68
35
0
30 Sep 2020
Transfer Learning from Speech Synthesis to Voice Conversion with Non-Parallel Training Data
Mingyang Zhang
Yi Zhou
Li Zhao
Haizhou Li
92
53
0
30 Sep 2020
Stock2Vec: A Hybrid Deep Learning Framework for Stock Market Prediction with Representation Learning and Temporal Convolutional Network
Xing Wang
Yijun Wang
Bin Weng
Aleksandr Vinel
AIFin
AI4TS
77
11
0
29 Sep 2020
Lip-reading with Densely Connected Temporal Convolutional Networks
Pingchuan Ma
Yujiang Wang
Jie Shen
Stavros Petridis
Maja Pantic
83
58
0
29 Sep 2020
Variational Temporal Deep Generative Model for Radar HRRP Target Recognition
D. Guo
Bo Chen
Wenchao Chen
Changbo Wang
Hongwei Liu
Mingyuan Zhou
BDL
88
36
0
28 Sep 2020
Recognition and Synthesis of Object Transport Motion
Connor Daly
GAN
16
0
0
27 Sep 2020
Iterative Reconstruction for Low-Dose CT using Deep Gradient Priors of Generative Model
Zhuonan He
Yikun Zhang
Yu Guan
S. Niu
Yi Zhang
Yang Chen
Qiegen Liu
DiffM
MedIm
93
12
0
27 Sep 2020
N-BEATS neural network for mid-term electricity load forecasting
Boris N. Oreshkin
Grzegorz Dudek
Paweł Pełka
Ekaterina Turkina
AI4TS
50
84
0
24 Sep 2020
Haar Wavelet based Block Autoregressive Flows for Trajectories
Apratim Bhattacharyya
C. Straehle
Mario Fritz
Bernt Schiele
AI4TS
70
15
0
21 Sep 2020
FakeTagger: Robust Safeguards against DeepFake Dissemination via Provenance Tracking
Run Wang
Felix Juefei Xu
Mengqing Luo
Yang Liu
Lina Wang
116
76
0
21 Sep 2020
DiffWave: A Versatile Diffusion Model for Audio Synthesis
Zhifeng Kong
Ming-Yu Liu
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
DiffM
BDL
253
1,472
0
21 Sep 2020
Shimon the Rapper: A Real-Time System for Human-Robot Interactive Rap Battles
Richard J. Savery
Lisa Zahray
Gil Weinberg
54
17
0
19 Sep 2020
GeneraLight: Improving Environment Generalization of Traffic Signal Control via Meta Reinforcement Learning
Chang-rui Liu
Huichu Zhang
Weinan Zhang
Guanjie Zheng
Yong Yu
48
45
0
17 Sep 2020
Recurrent autoencoder with sequence-aware encoding
Robert Susik
AI4TS
18
6
0
15 Sep 2020
Controllable neural text-to-speech synthesis using intuitive prosodic features
T. Raitio
Ramya Rasipuram
D. Castellani
78
66
0
14 Sep 2020
Adaptive Convolution Kernel for Artificial Neural Networks
F. B. Tek
Ilker Çam
D. Karli
33
14
0
14 Sep 2020
Visual-speech Synthesis of Exaggerated Corrective Feedback
Yaohua Bu
Weijun Li
Tianyi Ma
S. Chen
Jia Jia
Kun Li
Xiaobo Lu
45
1
0
12 Sep 2020
Activation Relaxation: A Local Dynamical Approximation to Backpropagation in the Brain
Beren Millidge
Alexander Tschantz
A. Seth
Christopher L. Buckley
ODL
81
17
0
11 Sep 2020
Exploration of End-to-end Synthesisers forZero Resource Speech Challenge 2020
Karthik Pandia D.S.
Anusha Prakash
M. M.
H. Murthy
42
4
0
10 Sep 2020
Improved Modeling of 3D Shapes with Multi-view Depth Maps
Kamal Gupta
Susmija Jabbireddy
Ketul Shah
Abhinav Shrivastava
Matthias Zwicker
3DV
43
5
0
07 Sep 2020
Proximity Sensing: Modeling and Understanding Noisy RSSI-BLE Signals and Other Mobile Sensor Data for Digital Contact Tracing
Sheshank Shankar
Rishank Kanaparti
Ayush Chopra
Rohan Sukumaran
Parth Patwa
Myungsun Kang
Abhishek Singh
Kevin P. McPherson
Ramesh Raskar
64
3
0
04 Sep 2020
HiFiSinger: Towards High-Fidelity Neural Singing Voice Synthesis
Jiawei Chen
Xu Tan
Jian Luan
Tao Qin
Tie-Yan Liu
VLM
104
93
0
03 Sep 2020
Voice Conversion by Cascading Automatic Speech Recognition and Text-to-Speech Synthesis with Prosody Transfer
Jing-Xuan Zhang
Li-Juan Liu
Yan-Nian Chen
Ya-Jun Hu
Yuan Jiang
Zhenhua Ling
Lirong Dai
57
17
0
03 Sep 2020
WaveGrad: Estimating Gradients for Waveform Generation
Nanxin Chen
Yu Zhang
Heiga Zen
Ron J. Weiss
Mohammad Norouzi
William Chan
DiffM
BDL
164
795
0
02 Sep 2020
LAVARNET: Neural Network Modeling of Causal Variable Relationships for Multivariate Time Series Forecasting
C. Koutlis
Symeon Papadopoulos
Emmanouil Schinas
Y. Kompatsiaris
CML
AI4TS
33
16
0
02 Sep 2020
Hierarchical Timbre-Painting and Articulation Generation
Michael Michelashvili
Lior Wolf
86
12
0
30 Aug 2020
Dynamical Variational Autoencoders: A Comprehensive Review
Laurent Girin
Simon Leglaive
Xiaoyu Bie
Julien Diard
Thomas Hueber
Xavier Alameda-Pineda
BDL
150
223
0
28 Aug 2020
Voice Conversion Challenge 2020: Intra-lingual semi-parallel and cross-lingual voice conversion
Yi Zhao
Wen-Chin Huang
Xiaohai Tian
Junichi Yamagishi
Rohan Kumar Das
Tomi Kinnunen
Zhenhua Ling
Tomoki Toda
88
211
0
28 Aug 2020
DrumGAN: Synthesis of Drum Sounds With Timbral Feature Conditioning Using Generative Adversarial Networks
J. Nistal
Stefan Lattner
G. Richard
GAN
86
56
0
27 Aug 2020
Nonparallel Voice Conversion with Augmented Classifier Star Generative Adversarial Networks
Hirokazu Kameoka
Takuhiro Kaneko
Kou Tanaka
Nobukatsu Hojo
99
20
0
27 Aug 2020
DeepVOX: Discovering Features from Raw Audio for Speaker Recognition in Non-ideal Audio Signals
Anurag Chowdhury
Arun Ross
34
2
0
26 Aug 2020
Generating Handwriting via Decoupled Style Descriptors
Atsunobu Kotani
Stefanie Tellex
James Tompkin
72
25
0
26 Aug 2020
ANGUS: Real-time manipulation of vocal roughness for emotional speech transformations
M. Liuni
Luc Ardaillon
L. Bonal
Lou Seropian
J. Aucouturier
8
4
0
25 Aug 2020
Medley2K: A Dataset of Medley Transitions
Lukas Faber
Sandro Luck
Damian Pascual
Andreas Roth
Gino Brunner
Roger Wattenhofer
MedIm
53
0
0
25 Aug 2020
Using Deep Networks for Scientific Discovery in Physiological Signals
Tom Beer
Bar Eini-Porat
Sebastian Goodfellow
Danny Eytan
Uri Shalit
59
5
0
25 Aug 2020
Dynamic Future Net: Diversified Human Motion Generation
Wenheng Chen
He Wang
Yi Yuan
Tianjia Shao
Kun Zhou
3DH
98
23
0
25 Aug 2020
ATM Cash demand forecasting in an Indian Bank with chaos and deep learning
Sarveswararao Vangala
V. Ravi
BDL
55
21
0
24 Aug 2020
RespVAD: Voice Activity Detection via Video-Extracted Respiration Patterns
A. Mondal
Prathosh A.P.
VGen
20
3
0
21 Aug 2020
asya: Mindful verbal communication using deep learning
Ē. Urtāns
Ariel Tabaks
VLM
121
1
0
20 Aug 2020
Laughter Synthesis: Combining Seq2seq modeling with Transfer Learning
Noé Tits
Kevin El Haddad
Thierry Dutoit
31
14
0
20 Aug 2020
Learning to Generate Diverse Dance Motions with Transformer
Jiaman Li
Yihang Yin
Hang Chu
Yi Zhou
Tingwu Wang
Sanja Fidler
Hao Li
86
125
0
18 Aug 2020
Efficient Low-Latency Speech Enhancement with Mobile Audio Streaming Networks
Michal Romaniuk
Piotr Masztalski
K. Piaskowski
M. Matuszewski
25
5
0
17 Aug 2020
POP909: A Pop-song Dataset for Music Arrangement Generation
Ziyu Wang
Kai Chen
Junyan Jiang
Yiyi Zhang
Maoran Xu
Shuqi Dai
Xianbin Gu
Gus Xia
76
139
0
17 Aug 2020
Unsupervised Acoustic Unit Representation Learning for Voice Conversion using WaveNet Auto-encoders
Mingjie Chen
Thomas Hain
SSL
DRL
54
15
0
16 Aug 2020
Audio Dequantization for High Fidelity Audio Generation in Flow-based Neural Vocoder
Hyun-Wook Yoon
Sang-Hoon Lee
Hyeong-Rae Noh
Seong-Whan Lee
111
11
0
16 Aug 2020
Previous
1
2
3
...
36
37
38
...
60
61
62
Next