Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.03499
Cited By
v1
v2 (latest)
WaveNet: A Generative Model for Raw Audio
12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"WaveNet: A Generative Model for Raw Audio"
50 / 3,082 papers shown
Title
Fast and small footprint Hybrid HMM-HiFiGAN based system for speech synthesis in Indian languages
Sudhanshu Srivastava
Ishika Gupta
Anusha Prakash
Jom Kuriakose
H. Murthy
VLM
70
1
0
13 Feb 2023
Vector Quantized Wasserstein Auto-Encoder
Tung-Long Vuong
Trung Le
He Zhao
Chuanxia Zheng
Mehrtash Harandi
Jianfei Cai
Dinh Q. Phung
DRL
68
20
0
12 Feb 2023
SLOTH: Structured Learning and Task-based Optimization for Time Series Forecasting on Hierarchies
Fang-le Zhou
Chenle Pan
Lintao Ma
Yu Liu
Shiyu Wang
...
Xu Hu
Yun Hu
Yang Zheng
Lei Lei
Yun Hu
84
3
0
11 Feb 2023
Pruning Deep Neural Networks from a Sparsity Perspective
Enmao Diao
G. Wang
Jiawei Zhan
Yuhong Yang
Jie Ding
Vahid Tarokh
82
32
0
11 Feb 2023
DNArch: Learning Convolutional Neural Architectures by Backpropagation
David W. Romero
Neil Zeghidour
AI4CE
59
4
0
10 Feb 2023
Hypernetworks build Implicit Neural Representations of Sounds
Filip Szatkowski
Karol J. Piczak
Przemtslaw Spurek
Jacek Tabor
Tomasz Trzciñski
118
11
0
09 Feb 2023
ERNIE-Music: Text-to-Waveform Music Generation with Diffusion Models
Peng Fei Zhu
Chao Pang
Yekun Chai
Lei Li
Shuohuan Wang
Yu Sun
Hao Tian
Hua Wu
DiffM
56
20
0
09 Feb 2023
Short-Term Memory Convolutions
Grzegorz Stefański
Krzysztof Arendt
P. Daniluk
Bartlomiej Jasik
Artur Szumaczuk
51
4
0
08 Feb 2023
Machine Learning for Synthetic Data Generation: A Review
Ying-Cheng Lu
Minjie Shen
Huazheng Wang
Xiao Wang
Capucine Van Rechem
Tianfan Fu
Wenqi Wei
SyDa
225
150
0
08 Feb 2023
Multi-Source Diffusion Models for Simultaneous Music Generation and Separation
Giorgio Mariani
Irene Tallini
Emilian Postolache
Michele Mancusi
Luca Cosmo
Emanuele Rodolà
DiffM
155
43
0
04 Feb 2023
Multivariate Time Series Anomaly Detection via Dynamic Graph Forecasting
Katrina Chen
M. Feng
T. Wirjanto
AI4TS
22
7
0
04 Feb 2023
Time Series Forecasting via Semi-Asymmetric Convolutional Architecture with Global Atrous Sliding Window
Yuanpeng He
AI4TS
73
0
0
31 Jan 2023
DiffSTG: Probabilistic Spatio-Temporal Graph Forecasting with Denoising Diffusion Models
Haomin Wen
Youfang Lin
Yutong Xia
Huaiyu Wan
Qingsong Wen
Roger Zimmermann
Yuxuan Liang
DiffM
135
94
0
31 Jan 2023
ArchiSound: Audio Generation with Diffusion
Flavio Schneider
72
25
0
30 Jan 2023
Deep networks for system identification: a Survey
G. Pillonetto
Aleksandr Aravkin
Daniel Gedon
L. Ljung
Antônio H. Ribeiro
Thomas B. Schon
OOD
109
45
0
30 Jan 2023
SingSong: Generating musical accompaniments from singing
Chris Donahue
Antoine Caillon
Adam Roberts
Ethan Manilow
P. Esling
...
Mauro Verzetti
Ian Simon
Olivier Pietquin
Neil Zeghidour
Jesse Engel
110
55
0
30 Jan 2023
Do We Really Need Graph Neural Networks for Traffic Forecasting?
Xu Liu
Yuxuan Liang
Chao Huang
Hengchang Hu
Yushi Cao
Bryan Hooi
Roger Zimmermann
AI4TS
105
22
0
30 Jan 2023
AudioLDM: Text-to-Audio Generation with Latent Diffusion Models
Haohe Liu
Zehua Chen
Yiitan Yuan
Xinhao Mei
Xubo Liu
Danilo Mandic
Wenwu Wang
Mark D. Plumbley
DiffM
180
510
0
29 Jan 2023
On granularity of prosodic representations in expressive text-to-speech
Mikolaj Babianski
Kamil Pokora
Raahil Shah
Rafał Sienkiewicz
Daniel Korzekwa
V. Klimkov
66
6
0
26 Jan 2023
MusicLM: Generating Music From Text
A. Agostinelli
Timo I. Denk
Zalan Borsos
Jesse Engel
Mauro Verzetti
...
Adam Roberts
Marco Tagliasacchi
Matthew Sharifi
Neil Zeghidour
Christian Frank
MGen
154
451
0
26 Jan 2023
Open Problems in Applied Deep Learning
M. Raissi
AI4CE
115
2
0
26 Jan 2023
Separate And Diffuse: Using a Pretrained Diffusion Model for Improving Source Separation
Shahar Lutati
Eliya Nachmani
Lior Wolf
DiffM
77
16
0
25 Jan 2023
Modelling Long Range Dependencies in
N
N
N
D: From Task-Specific to a General Purpose CNN
David M. Knigge
David W. Romero
Albert Gu
E. Gavves
Erik J. Bekkers
Jakub M. Tomczak
Mark Hoogendoorn
Jan-Jakob Sonke
3DV
91
22
0
25 Jan 2023
FInC Flow: Fast and Invertible
k
×
k
k \times k
k
×
k
Convolutions for Normalizing Flows
Aditya Kallappa
Sandeep Nagar
Girish Varma
73
2
0
23 Jan 2023
Dance2MIDI: Dance-driven multi-instruments music generation
Bo Han
Yuheng Li
Yixuan Shen
Yi Ren
Feilin Han
132
5
0
22 Jan 2023
Regeneration Learning: A Learning Paradigm for Data Generation
Xu Tan
Tao Qin
Jiang Bian
Tie-Yan Liu
Yoshua Bengio
GAN
64
15
0
21 Jan 2023
Novel-View Acoustic Synthesis
Changan Chen
Alexander Richard
Roman Shapovalov
V. Ithapu
Natalia Neverova
Kristen Grauman
Andrea Vedaldi
76
38
0
20 Jan 2023
An investigation of the reconstruction capacity of stacked convolutional autoencoders for log-mel-spectrograms
Anastasia Natsiou
Luca Longo
Seán O'Leary
18
0
0
18 Jan 2023
A Transformer-based Diffusion Probabilistic Model for Heart Rate and Blood Pressure Forecasting in Intensive Care Unit
Ping Chang
Huayu Li
S. Quan
Shuyang Lu
Shu-Fen Wung
Janet Roveda
Ao Li
DiffM
131
20
0
16 Jan 2023
CNN-Based Action Recognition and Pose Estimation for Classifying Animal Behavior from Videos: A Survey
Michael Perez
Corey Toler-Franklin
MedIm
75
15
0
15 Jan 2023
A Comprehensive Review of Data-Driven Co-Speech Gesture Generation
Simbarashe Nyatsanga
Taras Kucherenko
Chaitanya Ahuja
G. Henter
Michael Neff
SLR
114
94
0
13 Jan 2023
Spectral Cross-Domain Neural Network with Soft-adaptive Threshold Spectral Enhancement
Che Liu
Sibo Cheng
Weiping Ding
Rossella Arcucci
68
11
0
10 Jan 2023
Introducing Model Inversion Attacks on Automatic Speaker Recognition
Karla Pizzi
Franziska Boenisch
U. Sahin
Konstantin Böttinger
117
3
0
09 Jan 2023
Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
Chengyi Wang
Sanyuan Chen
Yu-Huan Wu
Zi-Hua Zhang
Long Zhou
...
Huaming Wang
Jinyu Li
Lei He
Sheng Zhao
Furu Wei
193
727
0
05 Jan 2023
Autonomous Drone Racing: A Survey
D. Hanover
Antonio Loquercio
L. Bauersfeld
Angel Romero
Robert Pěnička
Yunlong Song
Giovanni Cioffi
Elia Kaufmann
Davide Scaramuzza
156
66
0
04 Jan 2023
An end-to-end multi-scale network for action prediction in videos
Xiaofan Liu
Jianqin Yin
Yuanxi Sun
Zhicheng Zhang
Jin Tang
61
0
0
31 Dec 2022
Blind Restoration of Real-World Audio by 1D Operational GANs
T. Ince
S. Kiranyaz
Ozer Can Devecioglu
Muhammad Salman Khan
Muhammad Chowdhury
Moncef Gabbouj
61
4
0
30 Dec 2022
ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech
Ze Chen
Yihan Wu
Yichong Leng
Jiawei Chen
Haohe Liu
...
Ke Wang
Lei He
Sheng Zhao
Jiang Bian
Danilo Mandic
DiffM
108
23
0
30 Dec 2022
Voice conversion with limited data and limitless data augmentations
Olga Slizovskaia
Jordi Janer
Pritish Chandna
Oscar Mayor
64
1
0
27 Dec 2022
Neural Shape Compiler: A Unified Framework for Transforming between Text, Point Cloud, and Program
Tiange Luo
Honglak Lee
Justin Johnson
94
5
0
25 Dec 2022
Deep Latent State Space Models for Time-Series Generation
Linqi Zhou
Michael Poli
Winnie Xu
Stefano Massaroli
Stefano Ermon
BDL
AI4TS
88
38
0
24 Dec 2022
A Mathematical Framework for Learning Probability Distributions
Hongkang Yang
113
7
0
22 Dec 2022
End-to-End Automatic Speech Recognition model for the Sudanese Dialect
Ayman Mansour
Wafaa F. Mukhtar
39
1
0
21 Dec 2022
Dissecting Transformer Length Extrapolation via the Lens of Receptive Field Analysis
Ta-Chung Chi
Ting-Han Fan
Alexander I. Rudnicky
Peter J. Ramadge
81
42
0
20 Dec 2022
Contextually Enhanced ES-dRNN with Dynamic Attention for Short-Term Load Forecasting
Slawek Smyl
Grzegorz Dudek
Paweł Pełka
AI4TS
74
14
0
18 Dec 2022
Leveraging Wastewater Monitoring for COVID-19 Forecasting in the US: a Deep Learning study
Mehrdad Fazli
Heman Shakeri
44
1
0
17 Dec 2022
Text-to-speech synthesis based on latent variable conversion using diffusion probabilistic model and variational autoencoder
Yusuke Yasuda
Tomoki Toda
DiffM
79
8
0
16 Dec 2022
Convolution-enhanced Evolving Attention Networks
Yujing Wang
Yaming Yang
Zhuowan Li
Jiangang Bai
Mingliang Zhang
Xiangtai Li
Jiahao Yu
Ce Zhang
Gao Huang
Yu Tong
ViT
102
6
0
16 Dec 2022
RWEN-TTS: Relation-aware Word Encoding Network for Natural Text-to-Speech Synthesis
Shinhyeok Oh
HyeongRae Noh
Yoonseok Hong
Insoo Oh
77
0
0
15 Dec 2022
Hierarchical Strategies for Cooperative Multi-Agent Reinforcement Learning
M. Ibrahim
Ammar Fayad
64
1
0
14 Dec 2022
Previous
1
2
3
...
15
16
17
...
60
61
62
Next