ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.03499
  4. Cited By
WaveNet: A Generative Model for Raw Audio
v1v2 (latest)

WaveNet: A Generative Model for Raw Audio

12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
    DiffM
ArXiv (abs)PDFHTML

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,082 papers shown
Title
Knowledge-and-Data-Driven Amplitude Spectrum Prediction for Hierarchical
  Neural Vocoders
Knowledge-and-Data-Driven Amplitude Spectrum Prediction for Hierarchical Neural Vocoders
Yang Ai
Zhenhua Ling
65
8
0
16 Apr 2020
F0-consistent many-to-many non-parallel voice conversion via conditional
  autoencoder
F0-consistent many-to-many non-parallel voice conversion via conditional autoencoder
Kaizhi Qian
Zeyu Jin
M. Hasegawa-Johnson
G. J. Mysore
82
107
0
15 Apr 2020
Longformer: The Long-Document Transformer
Longformer: The Long-Document Transformer
Iz Beltagy
Matthew E. Peters
Arman Cohan
RALMVLM
244
4,111
0
10 Apr 2020
Scalable Multilingual Frontend for TTS
Scalable Multilingual Frontend for TTS
Alistair Conkie
A. Finch
37
13
0
10 Apr 2020
Exemplar VAE: Linking Generative Models, Nearest Neighbor Retrieval, and
  Data Augmentation
Exemplar VAE: Linking Generative Models, Nearest Neighbor Retrieval, and Data Augmentation
Sajad Norouzi
David J. Fleet
Mohammad Norouzi
VLMDRL
63
3
0
09 Apr 2020
Improving Expressivity of Graph Neural Networks
Improving Expressivity of Graph Neural Networks
Stanislaw J. Purgal
17
4
0
08 Apr 2020
State of the Art on Neural Rendering
State of the Art on Neural Rendering
A. Tewari
Ohad Fried
Justus Thies
Vincent Sitzmann
Stephen Lombardi
...
Christian Theobalt
Maneesh Agrawala
Eli Shechtman
Dan B. Goldman
Michael Zollhöfer
3DH3DV
141
473
0
08 Apr 2020
Training End-to-end Single Image Generators without GANs
Training End-to-end Single Image Generators without GANs
Yael Vinker
Nir Zabari
Yedid Hoshen
34
1
0
07 Apr 2020
SNR-Based Features and Diverse Training Data for Robust DNN-Based Speech
  Enhancement
SNR-Based Features and Diverse Training Data for Robust DNN-Based Speech Enhancement
R. Rehr
Timo Gerkmann
40
15
0
07 Apr 2020
From Artificial Neural Networks to Deep Learning for Music Generation --
  History, Concepts and Trends
From Artificial Neural Networks to Deep Learning for Music Generation -- History, Concepts and Trends
Jean-Pierre Briot
MGen
75
78
0
07 Apr 2020
Residual Shuffle-Exchange Networks for Fast Processing of Long Sequences
Residual Shuffle-Exchange Networks for Fast Processing of Long Sequences
Andis Draguns
Emīls Ozoliņš
A. Sostaks
Matiss Apinis
Kārlis Freivalds
56
8
0
06 Apr 2020
Forecast Network-Wide Traffic States for Multiple Steps Ahead: A Deep
  Learning Approach Considering Dynamic Non-Local Spatial Correlation and
  Non-Stationary Temporal Dependency
Forecast Network-Wide Traffic States for Multiple Steps Ahead: A Deep Learning Approach Considering Dynamic Non-Local Spatial Correlation and Non-Stationary Temporal Dependency
Xinglei Wang
Xuefeng Guan
Jun Cao
N. Zhang
Huayi Wu
GNNAI4TS
76
42
0
06 Apr 2020
Emotional Video to Audio Transformation Using Deep Recurrent Neural
  Networks and a Neuro-Fuzzy System
Emotional Video to Audio Transformation Using Deep Recurrent Neural Networks and a Neuro-Fuzzy System
Gwenaelle Cunha Sergio
Minho Lee
25
8
0
05 Apr 2020
SCT: Set Constrained Temporal Transformer for Set Supervised Action
  Segmentation
SCT: Set Constrained Temporal Transformer for Set Supervised Action Segmentation
Mohsen Fayyaz
Juergen Gall
ViT
75
71
0
31 Mar 2020
VaPar Synth -- A Variational Parametric Model for Audio Synthesis
VaPar Synth -- A Variational Parametric Model for Audio Synthesis
Krishna Subramani
Preeti Rao
Alexandre D'Hooge
35
9
0
30 Mar 2020
MCFlow: Monte Carlo Flow Models for Data Imputation
MCFlow: Monte Carlo Flow Models for Data Imputation
Trevor W. Richardson
Wencheng Wu
Lei Lin
Beilei Xu
Edgar A. Bernal
OOD
82
48
0
27 Mar 2020
Spatiotemporal Adaptive Neural Network for Long-term Forecasting of
  Financial Time Series
Spatiotemporal Adaptive Neural Network for Long-term Forecasting of Financial Time Series
Philippe Chatigny
Jean-Marc Patenaude
Shengrui Wang
AI4TS
62
5
0
27 Mar 2020
Learning To Solve Differential Equations Across Initial Conditions
Learning To Solve Differential Equations Across Initial Conditions
Shehryar Malik
Usman Anwar
Ali Ahmed
Alireza Aghasi
AI4CE
51
8
0
26 Mar 2020
Speech Quality Factors for Traditional and Neural-Based Low Bit Rate
  Vocoders
Speech Quality Factors for Traditional and Neural-Based Low Bit Rate Vocoders
Wissam A. Jassim
Jan Skoglund
Michael Chinen
Andrew Hines
19
8
0
26 Mar 2020
A Survey of Deep Learning for Scientific Discovery
A Survey of Deep Learning for Scientific Discovery
M. Raghu
Erica Schmidt
OODAI4CE
182
123
0
26 Mar 2020
TeCNO: Surgical Phase Recognition with Multi-Stage Temporal
  Convolutional Networks
TeCNO: Surgical Phase Recognition with Multi-Stage Temporal Convolutional Networks
Tobias Czempiel
Magdalini Paschali
Matthias Keicher
Walter Simson
H. Feußner
S. T. Kim
Nassir Navab
91
186
0
24 Mar 2020
Deep Attention Fusion Feature for Speech Separation with End-to-End
  Post-filter Method
Deep Attention Fusion Feature for Speech Separation with End-to-End Post-filter Method
Cunhang Fan
J. Tao
B. Liu
Jiangyan Yi
Zhengqi Wen
Xuefei Liu
61
9
0
17 Mar 2020
GraphTCN: Spatio-Temporal Interaction Modeling for Human Trajectory
  Prediction
GraphTCN: Spatio-Temporal Interaction Modeling for Human Trajectory Prediction
Chengxin Wang
Shaofeng Cai
Gary S. H. Tan
GNN
103
53
0
16 Mar 2020
On Translation Invariance in CNNs: Convolutional Layers can Exploit
  Absolute Spatial Location
On Translation Invariance in CNNs: Convolutional Layers can Exploit Absolute Spatial Location
O. Kayhan
Jan van Gemert
341
237
0
16 Mar 2020
Audio inpainting with generative adversarial network
Audio inpainting with generative adversarial network
P. Ebner
Amr Eltelt
GAN
70
24
0
13 Mar 2020
Dynamic Spatiotemporal Graph Neural Network with Tensor Network
Dynamic Spatiotemporal Graph Neural Network with Tensor Network
Chengcheng Jia
Bo Wu
Xiao-Ping Zhang
AI4TS
88
7
0
12 Mar 2020
Integrating Scientific Knowledge with Machine Learning for Engineering
  and Environmental Systems
Integrating Scientific Knowledge with Machine Learning for Engineering and Environmental Systems
J. Willard
X. Jia
Shaoming Xu
M. Steinbach
Vipin Kumar
AI4CE
158
415
0
10 Mar 2020
Unsupervised Style and Content Separation by Minimizing Mutual
  Information for Speech Synthesis
Unsupervised Style and Content Separation by Minimizing Mutual Information for Speech Synthesis
Ting-Yao Hu
A. Shrivastava
Oncel Tuzel
C. Dhir
57
32
0
09 Mar 2020
Adversarial Attacks on Probabilistic Autoregressive Forecasting Models
Adversarial Attacks on Probabilistic Autoregressive Forecasting Models
Raphaël Dang-Nhu
Gagandeep Singh
Pavol Bielik
Martin Vechev
AI4TSAAML
84
21
0
08 Mar 2020
Online Self-Supervised Learning for Object Picking: Detecting Optimum
  Grasping Position using a Metric Learning Approach
Online Self-Supervised Learning for Object Picking: Detecting Optimum Grasping Position using a Metric Learning Approach
Kanata Suzuki
Yasuto Yokota
Yuzi Kanazawa
T. Takebayashi
34
15
0
08 Mar 2020
TTPP: Temporal Transformer with Progressive Prediction for Efficient
  Action Anticipation
TTPP: Temporal Transformer with Progressive Prediction for Efficient Action Anticipation
Wen Wang
Xiaojiang Peng
Yanzhou Su
Yu Qiao
Jian Cheng
AI4TS
82
18
0
07 Mar 2020
Training Deep Energy-Based Models with f-Divergence Minimization
Training Deep Energy-Based Models with f-Divergence Minimization
Lantao Yu
Yang Song
Jiaming Song
Stefano Ermon
238
44
0
06 Mar 2020
Noise Estimation Using Density Estimation for Self-Supervised Multimodal
  Learning
Noise Estimation Using Density Estimation for Self-Supervised Multimodal Learning
Elad Amrani
Rami Ben-Ari
Daniel Rotman
A. Bronstein
134
126
0
06 Mar 2020
A Neural Network Based Framework for Archetypical Sound Synthesis
A Neural Network Based Framework for Archetypical Sound Synthesis
E. Guizzo
A. Novello
32
0
0
06 Mar 2020
Cost-Sensitive Portfolio Selection via Deep Reinforcement Learning
Cost-Sensitive Portfolio Selection via Deep Reinforcement Learning
Yifan Zhang
P. Zhao
Qingyao Wu
Bin Li
Junzhou Huang
Mingkui Tan
OOD
151
97
0
06 Mar 2020
Likelihood Regret: An Out-of-Distribution Detection Score For
  Variational Auto-encoder
Likelihood Regret: An Out-of-Distribution Detection Score For Variational Auto-encoder
Zhisheng Xiao
Qing Yan
Y. Amit
OODD
195
195
0
06 Mar 2020
Guided Generative Adversarial Neural Network for Representation Learning
  and High Fidelity Audio Generation using Fewer Labelled Audio Data
Guided Generative Adversarial Neural Network for Representation Learning and High Fidelity Audio Generation using Fewer Labelled Audio Data
Kazi Nazmul Haque
R. Rana
John H. L. Hansen
Björn Schuller
GAN
67
3
0
05 Mar 2020
AlignTTS: Efficient Feed-Forward Text-to-Speech System without Explicit
  Alignment
AlignTTS: Efficient Feed-Forward Text-to-Speech System without Explicit Alignment
Zhen Zeng
Jianzong Wang
Ning Cheng
Tian Xia
Jing Xiao
VLM
75
56
0
04 Mar 2020
GraphTTS: graph-to-sequence modelling in neural text-to-speech
GraphTTS: graph-to-sequence modelling in neural text-to-speech
Aolan Sun
Jianzong Wang
Ning Cheng
Huayi Peng
Zhen Zeng
Jing Xiao
52
21
0
04 Mar 2020
SELD-TCN: Sound Event Localization & Detection via Temporal
  Convolutional Networks
SELD-TCN: Sound Event Localization & Detection via Temporal Convolutional Networks
Karim Guirguis
Christoph Schorn
A. Guntoro
Sherif Abdulatif
Bin Yang
47
57
0
03 Mar 2020
Semi-supervised learning of glottal pulse positions in a neural
  analysis-synthesis framework
Semi-supervised learning of glottal pulse positions in a neural analysis-synthesis framework
F. Bous
Luc Ardaillon
Axel Roebel
23
1
0
02 Mar 2020
Learn2Perturb: an End-to-end Feature Perturbation Learning to Improve
  Adversarial Robustness
Learn2Perturb: an End-to-end Feature Perturbation Learning to Improve Adversarial Robustness
Ahmadreza Jeddi
M. Shafiee
Michelle Karg
C. Scharfenberger
A. Wong
OODAAML
129
67
0
02 Mar 2020
GANs with Conditional Independence Graphs: On Subadditivity of
  Probability Divergences
GANs with Conditional Independence Graphs: On Subadditivity of Probability Divergences
Mucong Ding
C. Daskalakis
Soheil Feizi
GAN
43
2
0
02 Mar 2020
Introduction to deep learning
Introduction to deep learning
Lihi Shiloh-Perl
Raja Giryes
67
0
0
29 Feb 2020
Temporal Convolutional Attention-based Network For Sequence Modeling
Temporal Convolutional Attention-based Network For Sequence Modeling
Hongyan Hao
Yan Wang
Siqiao Xue
Yudi Xia
Jian Zhao
S. Furao
75
41
0
28 Feb 2020
Time Series Data Augmentation for Deep Learning: A Survey
Time Series Data Augmentation for Deep Learning: A Survey
Qingsong Wen
Liang Sun
Fan Yang
Xiaomin Song
Jing Gao
Xue Wang
Huan Xu
AI4TS
156
649
0
27 Feb 2020
Woodbury Transformations for Deep Generative Flows
Woodbury Transformations for Deep Generative Flows
You Lu
Bert Huang
85
16
0
27 Feb 2020
Unsupervised Discovery, Control, and Disentanglement of Semantic
  Attributes with Applications to Anomaly Detection
Unsupervised Discovery, Control, and Disentanglement of Semantic Attributes with Applications to Anomaly Detection
William Paul
I-J. Wang
F. Alajaji
Philippe Burlina
DiffMDRL
58
6
0
25 Feb 2020
Informative Bayesian Neural Network Priors for Weak Signals
Informative Bayesian Neural Network Priors for Weak Signals
Tianyu Cui
A. Havulinna
Pekka Marttinen
Samuel Kaski
65
9
0
24 Feb 2020
Breaking Batch Normalization for better explainability of Deep Neural
  Networks through Layer-wise Relevance Propagation
Breaking Batch Normalization for better explainability of Deep Neural Networks through Layer-wise Relevance Propagation
M. Guillemot
C. Heusele
R. Korichi
S. Schnebert
Liming Chen
FAtt
57
18
0
24 Feb 2020
Previous
123...414243...606162
Next