ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.03499
  4. Cited By
WaveNet: A Generative Model for Raw Audio
v1v2 (latest)

WaveNet: A Generative Model for Raw Audio

12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
    DiffM
ArXiv (abs)PDFHTML

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,082 papers shown
Title
Inference-optimized AI and high performance computing for gravitational
  wave detection at scale
Inference-optimized AI and high performance computing for gravitational wave detection at scale
Pranshu Chaturvedi
Asad Khan
Minyang Tian
Eliu A. Huerta
Huihuo Zheng
72
28
0
26 Jan 2022
Invertible Voice Conversion
Invertible Voice Conversion
Zexin Cai
Ming Li
BDL
71
1
0
26 Jan 2022
Zero-Shot Long-Form Voice Cloning with Dynamic Convolution Attention
Zero-Shot Long-Form Voice Cloning with Dynamic Convolution Attention
Artem Gorodetskii
Ivan Ozhiganov
117
2
0
25 Jan 2022
Improving Adversarial Waveform Generation based Singing Voice Conversion
  with Harmonic Signals
Improving Adversarial Waveform Generation based Singing Voice Conversion with Harmonic Signals
Haohan Guo
Zhiping Zhou
Fanbo Meng
Kai-Chun Liu
100
16
0
25 Jan 2022
Text and Code Embeddings by Contrastive Pre-Training
Text and Code Embeddings by Contrastive Pre-Training
Arvind Neelakantan
Tao Xu
Raul Puri
Alec Radford
Jesse Michael Han
...
Tabarak Khan
Toki Sherbakov
Joanne Jang
Peter Welinder
Lilian Weng
SSLAI4TS
401
446
0
24 Jan 2022
Fast Transient Stability Prediction Using Grid-informed Temporal and
  Topological Embedding Deep Neural Network
Fast Transient Stability Prediction Using Grid-informed Temporal and Topological Embedding Deep Neural Network
Peiyuan Sun
L. Huo
Siyuan Liang
Xin Chen
64
7
0
23 Jan 2022
HiSTGNN: Hierarchical Spatio-temporal Graph Neural Networks for Weather Forecasting
Minbo Ma
Peng Xie
Fei Teng
Tian-Jie Li
Bin Wang
Shenggong Ji
Junbo Zhang
AI4TS
55
9
0
22 Jan 2022
Online POI Recommendation: Learning Dynamic Geo-Human Interactions in
  Streams
Online POI Recommendation: Learning Dynamic Geo-Human Interactions in Streams
Dongjie Wang
Kunpeng Liu
Hui Xiong
Yanjie Fu
160
8
0
19 Jan 2022
MHTTS: Fast multi-head text-to-speech for spontaneous speech with
  imperfect transcription
MHTTS: Fast multi-head text-to-speech for spontaneous speech with imperfect transcription
Dabiao Ma
Yitong Zhang
Meng Li
Feng Ye
39
1
0
19 Jan 2022
Opencpop: A High-Quality Open Source Chinese Popular Song Corpus for
  Singing Voice Synthesis
Opencpop: A High-Quality Open Source Chinese Popular Song Corpus for Singing Voice Synthesis
Yu Wang
Xinsheng Wang
Pengcheng Zhu
Jie Wu
Hanzhao Li
Heyang Xue
Yongmao Zhang
Lei Xie
Mengxiao Bi
112
103
0
19 Jan 2022
Variational Autoencoder Generative Adversarial Network for Synthetic
  Data Generation in Smart Home
Variational Autoencoder Generative Adversarial Network for Synthetic Data Generation in Smart Home
Mina Razghandi
Hao Zhou
Melike Erol-Kantarci
D. Turgut
45
33
0
19 Jan 2022
Dilated Convolutional Neural Networks for Lightweight Diacritics
  Restoration
Dilated Convolutional Neural Networks for Lightweight Diacritics Restoration
Bálint Csanády
András Lukács
26
0
0
18 Jan 2022
A Practical Guide to Logical Access Voice Presentation Attack Detection
A Practical Guide to Logical Access Voice Presentation Attack Detection
Xin Wang
Junichi Yamagishi
AAML
109
11
0
10 Jan 2022
Audio representations for deep learning in sound synthesis: A review
Audio representations for deep learning in sound synthesis: A review
Anastasia Natsiou
Seán O'Leary
AI4TS
74
18
0
07 Jan 2022
A sinusoidal signal reconstruction method for the inversion of the
  mel-spectrogram
A sinusoidal signal reconstruction method for the inversion of the mel-spectrogram
Anastasia Natsiou
Seán O'Leary
46
3
0
07 Jan 2022
Classification of Long Sequential Data using Circular Dilated
  Convolutional Neural Networks
Classification of Long Sequential Data using Circular Dilated Convolutional Neural Networks
Lei Cheng
Ruslan Khalitov
Tong Yu
Zhirong Yang
73
32
0
06 Jan 2022
Eye Know You Too: A DenseNet Architecture for End-to-end Eye Movement
  Biometrics
Eye Know You Too: A DenseNet Architecture for End-to-end Eye Movement Biometrics
Dillon Lohr
Oleg V. Komogortsev
77
4
0
05 Jan 2022
A Comprehensive Survey on Radio Frequency (RF) Fingerprinting:
  Traditional Approaches, Deep Learning, and Open Challenges
A Comprehensive Survey on Radio Frequency (RF) Fingerprinting: Traditional Approaches, Deep Learning, and Open Challenges
Anu Jagannath
Jithin Jagannath
P. Kumar
66
145
0
03 Jan 2022
Evaluating Deep Music Generation Methods Using Data Augmentation
Evaluating Deep Music Generation Methods Using Data Augmentation
Toby Godwin
Georgios Rizos
Alice Baird
N. A. Futaisi
Vincent Brisse
Bjoern W. Schuller
MGen
34
1
0
31 Dec 2021
InverseMV: Composing Piano Scores with a Convolutional Video-Music
  Transformer
InverseMV: Composing Piano Scores with a Convolutional Video-Music Transformer
Chin-Tung Lin
Mu Yang
ViT
51
1
0
31 Dec 2021
ERNIE-ViLG: Unified Generative Pre-training for Bidirectional
  Vision-Language Generation
ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation
Han Zhang
Weichong Yin
Yewei Fang
Lanxin Li
Boqiang Duan
Zhihua Wu
Yu Sun
Hao Tian
Hua Wu
Haifeng Wang
71
59
0
31 Dec 2021
BP-Net: Cuff-less, Calibration-free, and Non-invasive Blood Pressure
  Estimation via a Generic Deep Convolutional Architecture
BP-Net: Cuff-less, Calibration-free, and Non-invasive Blood Pressure Estimation via a Generic Deep Convolutional Architecture
Soheil Zabihi
E. Rahimian
Fatemeh Marefat
A. Asif
P. Mohseni
Arash Mohammadi
OOD
20
2
0
31 Dec 2021
Quasi-Taylor Samplers for Diffusion Generative Models based on Ideal
  Derivatives
Quasi-Taylor Samplers for Diffusion Generative Models based on Ideal Derivatives
Hideyuki Tachibana
Mocho Go
Muneyoshi Inahara
Yotaro Katayama
Yotaro Watanabe
DiffM
78
3
0
26 Dec 2021
Latent Space Simulation for Carbon Capture Design Optimization
Latent Space Simulation for Carbon Capture Design Optimization
Brian Bartoldson
Rui Wang
Yu-Hang Fu
David Widemann
Sam Nguyen
J. Bao
Zhijie Xu
Brenda Ng
52
3
0
22 Dec 2021
TFDPM: Attack detection for cyber-physical systems with diffusion
  probabilistic models
TFDPM: Attack detection for cyber-physical systems with diffusion probabilistic models
Tijin Yan
Tong Zhou
Yufeng Zhan
Yuanqing Xia
DiffM
59
8
0
20 Dec 2021
Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale
  Corpus
Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus
Rongjie Huang
Feiyang Chen
Yi Ren
Jinglin Liu
Chenye Cui
Zhou Zhao
97
104
0
20 Dec 2021
Soundify: Matching Sound Effects to Video
Soundify: Matching Sound Effects to Video
David Chuan-En Lin
Anastasis Germanidis
Cristobal Valenzuela
Yining Shi
Nikolas Martelaro
79
16
0
17 Dec 2021
MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical
  Modeling
MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical Modeling
Yusong Wu
Ethan Manilow
Yi Deng
Rigel Swavely
Kyle Kastner
Tim Cooijmans
Aaron Courville
Cheng-Zhi Anna Huang
Jesse Engel
87
45
0
17 Dec 2021
A Comparative Study of Detecting Anomalies in Time Series Data Using
  LSTM and TCN Models
A Comparative Study of Detecting Anomalies in Time Series Data Using LSTM and TCN Models
Saroj Gopali
Faranak Abri
Sima Siami‐Namini
A. Namin
AI4TS
31
13
0
17 Dec 2021
EmotionBox: a music-element-driven emotional music generation system
  using Recurrent Neural Network
EmotionBox: a music-element-driven emotional music generation system using Recurrent Neural Network
Kaitong Zheng
R. Meng
C. Zheng
Xiaodong Li
Jinqiu Sang
JuanJuan Cai
Jie Wang
MGen
58
2
0
16 Dec 2021
Leveraging Image-based Generative Adversarial Networks for Time Series
  Generation
Leveraging Image-based Generative Adversarial Networks for Time Series Generation
Justin Hellermann
Stefan Lessmann
GANAI4TS
72
4
0
15 Dec 2021
Tackling the Generative Learning Trilemma with Denoising Diffusion GANs
Tackling the Generative Learning Trilemma with Denoising Diffusion GANs
Zhisheng Xiao
Karsten Kreis
Arash Vahdat
DiffM
131
562
0
15 Dec 2021
Scale-Aware Neural Architecture Search for Multivariate Time Series
  Forecasting
Scale-Aware Neural Architecture Search for Multivariate Time Series Forecasting
Donghui Chen
Ling-Hao Chen
Zongjiang Shang
Youdong Zhang
Bo Wen
Chenghu Yang
AI4TS
73
7
0
14 Dec 2021
AI and extreme scale computing to learn and infer the physics of higher
  order gravitational wave modes of quasi-circular, spinning, non-precessing
  binary black hole mergers
AI and extreme scale computing to learn and infer the physics of higher order gravitational wave modes of quasi-circular, spinning, non-precessing binary black hole mergers
Asad Khan
E. A. H. abd
Prayush Kumar
74
5
0
13 Dec 2021
Computational bioacoustics with deep learning: a review and roadmap
Computational bioacoustics with deep learning: a review and roadmap
D. Stowell
98
259
0
13 Dec 2021
Lifelong Hyper-Policy Optimization with Multiple Importance Sampling
  Regularization
Lifelong Hyper-Policy Optimization with Multiple Importance Sampling Regularization
P. Liotet
Francesco Vidaich
Alberto Maria Metelli
Marcello Restelli
OffRL
71
8
0
13 Dec 2021
Causal Knowledge Guided Societal Event Forecasting
Causal Knowledge Guided Societal Event Forecasting
Songgaojun Deng
Huzefa Rangwala
Yue Ning
AI4TS
61
2
0
10 Dec 2021
Neural Multi-Quantile Forecasting for Optimal Inventory Management
Neural Multi-Quantile Forecasting for Optimal Inventory Management
Federico Garza Ramírez
32
1
0
10 Dec 2021
LipSound2: Self-Supervised Pre-Training for Lip-to-Speech Reconstruction
  and Lip Reading
LipSound2: Self-Supervised Pre-Training for Lip-to-Speech Reconstruction and Lip Reading
Leyuan Qu
C. Weber
S. Wermter
79
23
0
09 Dec 2021
Forecasting Brain Activity Based on Models of Spatio-Temporal Brain
  Dynamics: A Comparison of Graph Neural Network Architectures
Forecasting Brain Activity Based on Models of Spatio-Temporal Brain Dynamics: A Comparison of Graph Neural Network Architectures
S. Wein
Alina Schüller
A. Tomé
W. Malloni
M. Greenlee
E. Lang
AI4CE
86
15
0
08 Dec 2021
Periodic Residual Learning for Crowd Flow Forecasting
Periodic Residual Learning for Crowd Flow Forecasting
Chengxin Wang
Yuxuan Liang
Gary S. H. Tan
AI4TS
47
12
0
08 Dec 2021
Dilated convolution with learnable spacings
Dilated convolution with learnable spacings
Ismail Khalfaoui-Hassani
Thomas Pellegrini
T. Masquelier
125
32
0
07 Dec 2021
VocBench: A Neural Vocoder Benchmark for Speech Synthesis
VocBench: A Neural Vocoder Benchmark for Speech Synthesis
Ehab A. AlBadawy
Andrew Gibiansky
Qing He
Jilong Wu
Ming-Ching Chang
Siwei Lyu
61
12
0
06 Dec 2021
Parameter Efficient Deep Probabilistic Forecasting
Parameter Efficient Deep Probabilistic Forecasting
O. Sprangers
Sebastian Schelter
Maarten de Rijke
BDLAI4TS
118
24
0
06 Dec 2021
Dynamic Graph Learning-Neural Network for Multivariate Time Series
  Modeling
Dynamic Graph Learning-Neural Network for Multivariate Time Series Modeling
Zhuoling Li
Gaowei Zhang
Lingyu Xu
Jie Yu
AI4TS
42
2
0
06 Dec 2021
ES-dRNN: A Hybrid Exponential Smoothing and Dilated Recurrent Neural
  Network Model for Short-Term Load Forecasting
ES-dRNN: A Hybrid Exponential Smoothing and Dilated Recurrent Neural Network Model for Short-Term Load Forecasting
Slawek Smyl
Grzegorz Dudek
Paweł Pełka
AI4TS
61
33
0
05 Dec 2021
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice
  Conversion for everyone
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Edresson Casanova
Julian Weber
C. Shulby
Arnaldo Cândido Júnior
Eren Golge
M. Ponti
249
415
0
04 Dec 2021
My(o) Armband Leaks Passwords: An EMG and IMU Based Keylogging
  Side-Channel Attack
My(o) Armband Leaks Passwords: An EMG and IMU Based Keylogging Side-Channel Attack
Matthias Gazzari
Annemarie Mattmann
Max Maass
M. Hollick
47
5
0
04 Dec 2021
Joint Audio-Text Model for Expressive Speech-Driven 3D Facial Animation
Joint Audio-Text Model for Expressive Speech-Driven 3D Facial Animation
Yingruo Fan
Zhaojiang Lin
Jun Saito
Wenping Wang
Taku Komura
69
22
0
04 Dec 2021
Deep Efficient Continuous Manifold Learning for Time Series Modeling
Deep Efficient Continuous Manifold Learning for Time Series Modeling
Seungwoo Jeong
Wonjun Ko
A. Mulyadi
Heung-Il Suk
AI4TS
92
7
0
03 Dec 2021
Previous
123...242526...606162
Next