Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.03499
Cited By
v1
v2 (latest)
WaveNet: A Generative Model for Raw Audio
12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"WaveNet: A Generative Model for Raw Audio"
50 / 3,082 papers shown
Title
Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models
Simian Luo
Chuanhao Yan
Chenxu Hu
Hang Zhao
DiffM
105
83
0
29 Jun 2023
Physics-inspired spatiotemporal-graph AI ensemble for the detection of higher order wave mode signals of spinning binary black hole mergers
Minyang Tian
Eliu A. Huerta
Huihuo Zheng
Prayush Kumar
52
5
0
27 Jun 2023
Machine learning in solar physics
A. Asensio Ramos
Mark C. M. Cheung
I. Chifu
Ricardo Gafeira
AI4CE
PINN
62
35
0
27 Jun 2023
Intensity-free Convolutional Temporal Point Process: Incorporating Local and Global Event Contexts
Wangtao Zhou
Zhao Kang
Ling Tian
Yimu Su
102
11
0
24 Jun 2023
MFCCGAN: A Novel MFCC-Based Speech Synthesizer Using Adversarial Learning
Mohammad Reza Hasanabadi
26
3
0
22 Jun 2023
Exploring the Landscape of Ubiquitous In-home Health Monitoring: A Comprehensive Survey
Farhad Pourpanah
Ali Etemad
100
5
0
22 Jun 2023
Comparing Deep Learning Models for the Task of Volatility Prediction Using Multivariate Data
Wenbo Ge
Pooia Lalbakhsh
Leigh Isai
Artem Lenskiy
Hanna Suominen
OOD
34
3
0
20 Jun 2023
HDVIO: Improving Localization and Disturbance Estimation with Hybrid Dynamics VIO
Giovanni Cioffi
L. Bauersfeld
Davide Scaramuzza
104
15
0
20 Jun 2023
Understanding Deep Generative Models with Generalized Empirical Likelihoods
Suman V. Ravuri
Mélanie Rey
S. Mohamed
M. Deisenroth
VLM
72
5
0
16 Jun 2023
Power-law Dynamic arising from machine learning
Wei Chen
Weitao Du
Zhi-Ming Ma
Qi Meng
35
0
0
16 Jun 2023
Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis
Shivam Mehta
Siyang Wang
Simon Alexanderson
Jonas Beskow
Éva Székely
G. Henter
DiffM
103
14
0
15 Jun 2023
Gesper: A Restoration-Enhancement Framework for General Speech Reconstruction
Wenzhe Liu
Yupeng Shi
Jun Chen
Wei Rao
Shulin He
Andong Li
Yannan Wang
Zhiyong Wu
54
6
0
14 Jun 2023
Data Augmentation for Seizure Prediction with Generative Diffusion Model
Kai Shu
Yuchang Zhao
Le Wu
Aiping Liu
Ruobing Qian
Xun Chen
DiffM
73
13
0
14 Jun 2023
HiddenSinger: High-Quality Singing Voice Synthesis via Neural Audio Codec and Latent Diffusion Models
Ji-Sang Hwang
Sang-Hoon Lee
Seong-Whan Lee
DiffM
60
9
0
12 Jun 2023
Efficient Skip Connections Realization for Secure Inference on Encrypted Data
Nir Drucker
Itamar Zimerman
41
1
0
11 Jun 2023
High-Fidelity Audio Compression with Improved RVQGAN
Rithesh Kumar
Prem Seetharaman
Alejandro Luebs
I. Kumar
Kundan Kumar
126
338
0
11 Jun 2023
The Age of Synthetic Realities: Challenges and Opportunities
J. P. Cardenuto
Jing Yang
Rafael Padilha
Renjie Wan
Daniel Moreira
Haoliang Li
Shiqi Wang
Fernanda A. Andaló
Sébastien Marcel
Anderson de Rezende Rocha
DeLMO
115
30
0
09 Jun 2023
Boosting Fast and High-Quality Speech Synthesis with Linear Diffusion
Hao Liu
Tao Wang
Jie Cao
Ran He
J. Tao
DiffM
74
4
0
09 Jun 2023
Does Long-Term Series Forecasting Need Complex Attention and Extra Long Inputs?
Daojun Liang
Haixia Zhang
Dongfeng Yuan
Xiaoyan Ma
Dongyang Li
Minggao Zhang
AI4TS
41
6
0
08 Jun 2023
Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias
Ziyue Jiang
Yi Ren
Zhe Ye
Jinglin Liu
Chen Zhang
...
Rongjie Huang
Chunfeng Wang
Xiang Yin
Zejun Ma
Zhou Zhao
DiffM
105
80
0
06 Jun 2023
LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading
Yochai Yemini
Aviv Shamsian
Lior Bracha
Sharon Gannot
Ethan Fetaya
DiffM
116
15
0
05 Jun 2023
Rhythm-controllable Attention with High Robustness for Long Sentence Speech Synthesis
Dengfeng Ke
Yayue Deng
Yukang Jia
Jinlong Xue
Qi Luo
Ya Li
Jianqing Sun
Jiaen Liang
Binghuai Lin
39
0
0
05 Jun 2023
Non-parametric Probabilistic Time Series Forecasting via Innovations Representation
Xinyi Wang
Mei-Yu Lee
Qing Zhao
Lang Tong
AI4TS
94
3
0
05 Jun 2023
In-the-wild Speech Emotion Conversion Using Disentangled Self-Supervised Representations and Neural Vocoder-based Resynthesis
N. Prabhu
N. Lehmann-Willenbrock
Timo Gerkmann
76
3
0
02 Jun 2023
Towards Robust FastSpeech 2 by Modelling Residual Multimodality
Fabian Kögel
Bac Nguyen
Fabien Cardinaux
55
2
0
02 Jun 2023
HD-DEMUCS: General Speech Restoration with Heterogeneous Decoders
Doyeon Kim
Soo-Whan Chung
Hyewon Han
Youna Ji
Hong-Goo Kang
71
7
0
02 Jun 2023
Hierarchical Attention Encoder Decoder
Asier Mujika
BDL
57
3
0
01 Jun 2023
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
Hubert Siuzdak
148
104
0
01 Jun 2023
Enhancing the Unified Streaming and Non-streaming Model with Contrastive Learning
Yuting Yang
Yuke Li
Binbin Du
AI4TS
70
0
0
01 Jun 2023
Challenges and Remedies to Privacy and Security in AIGC: Exploring the Potential of Privacy Computing, Blockchain, and Beyond
Chuan Chen
Zhenpeng Wu
Yan-Hao Lai
Wen-chao Ou
Tianchi Liao
Zibin Zheng
136
36
0
01 Jun 2023
Learning Gaussian Mixture Representations for Tensor Time Series Forecasting
Jiewen Deng
Jinliang Deng
Renhe Jiang
Xuan Song
AI4TS
99
5
0
01 Jun 2023
HiGen: Hierarchical Graph Generative Networks
Mahdi Karami
77
4
0
30 May 2023
Client: Cross-variable Linear Integrated Enhanced Transformer for Multivariate Long-Term Time Series Forecasting
Jiaxin Gao
Wenbo Hu
Yuntian Chen
AI4TS
81
13
0
30 May 2023
LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus
Yuma Koizumi
Heiga Zen
Shigeki Karita
Yifan Ding
Kohei Yatabe
Nobuyuki Morioka
M. Bacchiani
Yu Zhang
Wei Han
Ankur Bapna
114
80
0
30 May 2023
Graph Neural Processes for Spatio-Temporal Extrapolation
Junfeng Hu
Yuxuan Liang
Zhencheng Fan
Hongyang Chen
Yu Zheng
Roger Zimmermann
BDL
76
12
0
30 May 2023
Forward and Inverse Approximation Theory for Linear Temporal Convolutional Networks
Hao Jiang
Qianxiao Li
AI4TS
87
0
0
29 May 2023
Diff-Instruct: A Universal Approach for Transferring Knowledge From Pre-trained Diffusion Models
Weijian Luo
Tianyang Hu
Shifeng Zhang
Jiacheng Sun
Zhenguo Li
Zhihua Zhang
124
138
0
29 May 2023
BiomedGPT: A Unified and Generalist Biomedical Generative Pre-trained Transformer for Vision, Language, and Multimodal Tasks
Kai Zhang
Jun Yu
Eashan Adhikarla
Rong Zhou
Zhilin Yan
...
Xun Chen
Yong Chen
Quanzheng Li
Hongfang Liu
Lichao Sun
LM&MA
MedIm
105
186
0
26 May 2023
Generative Adversarial Reduced Order Modelling
Dario Coscia
N. Demo
G. Rozza
GAN
AI4CE
160
7
0
25 May 2023
Market Making with Deep Reinforcement Learning from Limit Order Books
Hongli Guo
Jianwu Lin
Fanlin Huang
OffRL
152
2
0
25 May 2023
TLNets: Transformation Learning Networks for long-range time-series prediction
Wen Wang
Yang Liu
Haoqin Sun
AI4TS
72
3
0
25 May 2023
Efficient Neural Music Generation
Max W. Y. Lam
Qiao Tian
Tang-Chun Li
Zongyu Yin
Siyuan Feng
...
Mingbo Ma
Xuchen Song
Jitong Chen
Yuping Wang
Yuxuan Wang
DiffM
MGen
95
56
0
25 May 2023
Sound Design Strategies for Latent Audio Space Explorations Using Deep Learning Architectures
Kivancc Tatar
Kelsey Cotton
D. Bisig
DiffM
53
2
0
24 May 2023
Interpretation of Time-Series Deep Models: A Survey
Ziqi Zhao
Yucheng Shi
Shushan Wu
Fan Yang
Wenzhan Song
Ninghao Liu
AI4TS
95
7
0
23 May 2023
Training Transitive and Commutative Multimodal Transformers with LoReTTa
Manuel Tran
Yashin Dicente Cid
Amal Lahiani
Fabian J. Theis
Tingying Peng
Eldad Klaiman
58
2
0
23 May 2023
FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models
Ziyue Jiang
Qiang Yang
Jia-li Zuo
Zhe Ye
Rongjie Huang
Yixiang Ren
Zhou Zhao
DiffM
99
17
0
23 May 2023
Handling Label Uncertainty on the Example of Automatic Detection of Shepherd's Crook RCA in Coronary CT Angiography
Felix Denzinger
M. Wels
O. Taubmann
Florian Kordon
Fabian Wagner
...
F. André
S. Buss
Johannes Görich
M. Sühling
Andreas Maier
43
0
0
22 May 2023
RWKV: Reinventing RNNs for the Transformer Era
Bo Peng
Eric Alcaide
Quentin G. Anthony
Alon Albalak
Samuel Arcadinho
...
Qihang Zhao
P. Zhou
Qinghua Zhou
Jian Zhu
Rui-Jie Zhu
240
614
0
22 May 2023
Towards generalizing deep-audio fake detection networks
Konstantin Gasenzer
Moritz Wolter
75
4
0
22 May 2023
Forecasting Irregularly Sampled Time Series using Graphs
Vijaya Krishna Yalavarthi
Kiran Madusudanan
Randolf Scholz
Nourhan Ahmed
Johannes Burchert
Shayan Jawed
Stefan Born
Lars Schmidt-Thieme
AI4TS
44
2
0
22 May 2023
Previous
1
2
3
...
12
13
14
...
60
61
62
Next