Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.03499
Cited By
WaveNet: A Generative Model for Raw Audio
12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"WaveNet: A Generative Model for Raw Audio"
50 / 3,046 papers shown
Title
LipSound2: Self-Supervised Pre-Training for Lip-to-Speech Reconstruction and Lip Reading
Leyuan Qu
C. Weber
S. Wermter
43
23
0
09 Dec 2021
Forecasting Brain Activity Based on Models of Spatio-Temporal Brain Dynamics: A Comparison of Graph Neural Network Architectures
S. Wein
Alina Schüller
A. Tomé
W. Malloni
M. Greenlee
E. Lang
AI4CE
49
14
0
08 Dec 2021
Periodic Residual Learning for Crowd Flow Forecasting
Chengxin Wang
Yuxuan Liang
Gary S. H. Tan
AI4TS
25
12
0
08 Dec 2021
Dilated convolution with learnable spacings
Ismail Khalfaoui-Hassani
Thomas Pellegrini
T. Masquelier
33
32
0
07 Dec 2021
VocBench: A Neural Vocoder Benchmark for Speech Synthesis
Ehab A. AlBadawy
Andrew Gibiansky
Qing He
Jilong Wu
Ming-Ching Chang
Siwei Lyu
32
12
0
06 Dec 2021
Parameter Efficient Deep Probabilistic Forecasting
O. Sprangers
Sebastian Schelter
Maarten de Rijke
BDL
AI4TS
79
21
0
06 Dec 2021
Dynamic Graph Learning-Neural Network for Multivariate Time Series Modeling
Zhuoling Li
Gaowei Zhang
Lingyu Xu
Jie Yu
AI4TS
24
2
0
06 Dec 2021
ES-dRNN: A Hybrid Exponential Smoothing and Dilated Recurrent Neural Network Model for Short-Term Load Forecasting
Slawek Smyl
Grzegorz Dudek
Paweł Pełka
AI4TS
27
27
0
05 Dec 2021
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Edresson Casanova
Julian Weber
C. Shulby
Arnaldo Cândido Júnior
Eren Golge
M. Ponti
196
387
0
04 Dec 2021
My(o) Armband Leaks Passwords: An EMG and IMU Based Keylogging Side-Channel Attack
Matthias Gazzari
Annemarie Mattmann
Max Maass
M. Hollick
24
5
0
04 Dec 2021
Joint Audio-Text Model for Expressive Speech-Driven 3D Facial Animation
Yingruo Fan
Zhaojiang Lin
Jun Saito
Wenping Wang
Taku Komura
36
21
0
04 Dec 2021
Deep Efficient Continuous Manifold Learning for Time Series Modeling
Seungwoo Jeong
Wonjun Ko
A. Mulyadi
Heung-Il Suk
AI4TS
36
7
0
03 Dec 2021
A Daily Tourism Demand Prediction Framework Based on Multi-head Attention CNN: The Case of The Foreign Entrant in South Korea
Dong-Keon Kim
S. K. Shyn
Donghee Kim
Seungwoo Jang
Kwangsu Kim
AI4TS
24
6
0
01 Dec 2021
Improving Deep Learning Interpretability by Saliency Guided Training
Aya Abdelsalam Ismail
H. C. Bravo
Soheil Feizi
FAtt
33
81
0
29 Nov 2021
How Deep Are the Fakes? Focusing on Audio Deepfake: A Survey
Zahra Khanjani
Gabrielle Watson
V. P Janeja
30
25
0
28 Nov 2021
When Creators Meet the Metaverse: A Survey on Computational Arts
Lik-Hang Lee
Zijun Lin
Rui Hu
Zhengya Gong
Abhishek Kumar
Tangyao Li
Sijia Li
Pan Hui
45
88
0
26 Nov 2021
A-Muze-Net: Music Generation by Composing the Harmony based on the Generated Melody
Orly Goren
Eliya Nachmani
Lior Wolf
50
4
0
25 Nov 2021
Spatio-Temporal Joint Graph Convolutional Networks for Traffic Forecasting
Chuanpan Zheng
Xiaoliang Fan
Shirui Pan
Haibing Jin
Zhaopeng Peng
Zonghan Wu
Cheng-Yu Wang
Philip S. Yu
GNN
AI4TS
22
50
0
25 Nov 2021
MM-Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing
Jiashuo Yu
Ying Cheng
Ruiwei Zhao
Rui Feng
Yuejie Zhang
37
53
0
24 Nov 2021
Guided-TTS: A Diffusion Model for Text-to-Speech via Classifier Guidance
Heeseung Kim
Sungwon Kim
Sungroh Yoon
DiffM
BDL
24
107
0
23 Nov 2021
Neural Fields in Visual Computing and Beyond
Yiheng Xie
Towaki Takikawa
Shunsuke Saito
Or Litany
Shiqin Yan
Numair Khan
Federico Tombari
James Tompkin
Vincent Sitzmann
Srinath Sridhar
3DH
85
620
0
22 Nov 2021
Deep Spoken Keyword Spotting: An Overview
Iván López-Espejo
Zheng-Hua Tan
John H. L. Hansen
Jesper Jensen
32
103
0
20 Nov 2021
More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech
Michael Hassid
Michelle Tadmor Ramanovich
Brendan Shillingford
Miaosen Wang
Ye Jia
Tal Remez
DiffM
30
17
0
19 Nov 2021
Differentiable Wavetable Synthesis
Siyuan Shan
Lamtharn Hantrakul
Jitong Chen
Matt Avent
David Trevelyan
47
20
0
19 Nov 2021
A transformer-based model for default prediction in mid-cap corporate markets
Kamesh Korangi
Christophe Mues
Cristián Bravo
AI4TS
33
27
0
18 Nov 2021
Causal Forecasting:Generalization Bounds for Autoregressive Models
L. C. Vankadara
P. M. Faller
Michaela Hardt
Lenon Minorics
Debarghya Ghoshdastidar
Dominik Janzing
OOD
38
6
0
18 Nov 2021
High Quality Streaming Speech Synthesis with Low, Sentence-Length-Independent Latency
Nikolaos Ellinas
G. Vamvoukakis
K. Markopoulos
Aimilios Chalamandaris
Georgia Maniati
Panos Kakoulidis
S. Raptis
June Sig Sung
Hyoungmin Park
Pirros Tsiakoulis
32
36
0
17 Nov 2021
Online Advertising Revenue Forecasting: An Interpretable Deep Learning Approach
Maximilian Würfel
Qiwei Han
Maximilian Kaiser
AI4TS
33
8
0
16 Nov 2021
Towards Generating Real-World Time Series Data
Hengzhi Pei
Kan Ren
Yuqing Yang
Chang-Shu Liu
Tao Qin
Dongsheng Li
AI4TS
41
34
0
16 Nov 2021
Video Background Music Generation with Controllable Music Transformer
Shangzhe Di
Jiang
Sihan Liu
Zhaokai Wang
Leyan Zhu
Zexin He
Hongming Liu
Shuicheng Yan
32
90
0
16 Nov 2021
Property Inference Attacks Against GANs
Junhao Zhou
Yufei Chen
Chao Shen
Yang Zhang
AAML
MIACV
53
52
0
15 Nov 2021
Skillful Twelve Hour Precipitation Forecasts using Large Context Neural Networks
L. Espeholt
Shreya Agrawal
C. Sønderby
M. Kumar
Jonathan Heek
Carla Bromberg
Cenk Gazen
Jason Hickey
Aaron Bell
Nal Kalchbrenner
AI4Cl
30
48
0
14 Nov 2021
AC-VC: Non-parallel Low Latency Phonetic Posteriorgrams Based Voice Conversion
Damien Ronssin
Milos Cernak
33
10
0
12 Nov 2021
Learning Signal-Agnostic Manifolds of Neural Fields
Yilun Du
Katherine M. Collins
J. Tenenbaum
Vincent Sitzmann
MedIm
34
47
0
11 Nov 2021
CAESynth: Real-Time Timbre Interpolation and Pitch Control with Conditional Autoencoders
Aaron Valero Puche
Sukhan Lee
26
1
0
09 Nov 2021
RAVE: A variational autoencoder for fast and high-quality neural audio synthesis
Antoine Caillon
P. Esling
DRL
27
110
0
09 Nov 2021
Phantom: A High-Performance Computational Core for Sparse Convolutional Neural Networks
Mahmood Azhar Qureshi
Arslan Munir
53
0
0
09 Nov 2021
SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points
Yu-Chen Lin
Cheng Yu
Y. Hsu
Szu-Wei Fu
Yu Tsao
Tei-Wei Kuo
19
6
0
08 Nov 2021
Speaker Generation
Daisy Stanton
Matt Shannon
Soroosh Mariooryad
RJ Skerry-Ryan
Eric Battenberg
Tom Bagby
David Kao
33
28
0
07 Nov 2021
Meta-TTS: Meta-Learning for Few-Shot Speaker Adaptive Text-to-Speech
Sung-Feng Huang
Chyi-Jiunn Lin
Da-Rong Liu
Yi-Chen Chen
Hung-yi Lee
60
56
0
07 Nov 2021
Emotional Prosody Control for Speech Generation
S. Sivaprasad
Saiteja Kosgi
Vineet Gandhi
18
17
0
07 Nov 2021
Multi-Scale Semantics-Guided Neural Networks for Efficient Skeleton-Based Human Action Recognition
Pengfei Zhang
Cuiling Lan
Wenjun Zeng
Junliang Xing
Jianru Xue
Nanning Zheng
55
6
0
07 Nov 2021
Development of a robust cascaded architecture for intelligent robot grasping using limited labelled data
Priya Shukla
V. Kushwaha
G. C. Nandi
34
4
0
06 Nov 2021
A Bayesian generative neural network framework for epidemic inference problems
I. Biazzo
Alfredo Braunstein
Luca DallÁsta
Fabio Mazza
41
16
0
05 Nov 2021
Generating Diverse Realistic Laughter for Interactive Art
Mehdi Park Eric Paquette Étienne Gidel Gauthier Mathewso Afsar
Eric Park
Étienne Paquette
Gauthier Gidel
Kory W. Mathewson
Eilif B. Muller
28
7
0
04 Nov 2021
WaveFake: A Data Set to Facilitate Audio Deepfake Detection
Joel Frank
Lea Schonherr
DiffM
140
125
0
04 Nov 2021
A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
Benjamin van Niekerk
M. Carbonneau
Julian Zaïdi
Matthew Baas
Hugo Seuté
Herman Kamper
DRL
55
114
0
03 Nov 2021
Weight, Block or Unit? Exploring Sparsity Tradeoffs for Speech Enhancement on Tiny Neural Accelerators
Marko Stamenovic
Nils L. Westhausen
Li-Chia Yang
Carl R. Jensen
Alex Pawlicki
38
10
0
03 Nov 2021
Trajectory Prediction with Graph-based Dual-scale Context Fusion
Lu Zhang
Peiliang Li
Jing Chen
Shaojie Shen
50
28
0
02 Nov 2021
WaveSense: Efficient Temporal Convolutions with Spiking Neural Networks for Keyword Spotting
Philipp Weidel
Sadique Sheik
12
14
0
02 Nov 2021
Previous
1
2
3
...
24
25
26
...
59
60
61
Next