ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.03499
  4. Cited By
WaveNet: A Generative Model for Raw Audio
v1v2 (latest)

WaveNet: A Generative Model for Raw Audio

12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
    DiffM
ArXiv (abs)PDFHTML

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,082 papers shown
Title
Action Segmentation with Mixed Temporal Domain Adaptation
Action Segmentation with Mixed Temporal Domain Adaptation
Min-Hung Chen
Baopu Li
Yingze Bao
Ghassan AlRegib
120
30
0
15 Apr 2021
RIANN -- A Robust Neural Network Outperforms Attitude Estimation Filters
RIANN -- A Robust Neural Network Outperforms Attitude Estimation Filters
Daniel Weber
C. Gühmann
Thomas Seel
69
37
0
15 Apr 2021
On the Design of Deep Priors for Unsupervised Audio Restoration
On the Design of Deep Priors for Unsupervised Audio Restoration
V. Narayanaswamy
Jayaraman J. Thiagarajan
A. Spanias
AI4CE
55
5
0
14 Apr 2021
ADNet: Temporal Anomaly Detection in Surveillance Videos
ADNet: Temporal Anomaly Detection in Surveillance Videos
H. Öztürk
Ahmet Burak Can
130
15
0
14 Apr 2021
Neural basis expansion analysis with exogenous variables: Forecasting
  electricity prices with NBEATSx
Neural basis expansion analysis with exogenous variables: Forecasting electricity prices with NBEATSx
Kin G. Olivares
Cristian Challu
Grzegorz Marcjasz
R. Weron
A. Dubrawski
AI4TS
99
151
0
12 Apr 2021
Boltzmann Tuning of Generative Models
Boltzmann Tuning of Generative Models
Victor Berger
Michele Sebag
57
0
0
12 Apr 2021
Protein sequence design with deep generative models
Protein sequence design with deep generative models
Zachary Wu
Kadina E. Johnston
F. Arnold
Kevin Kaichuang Yang
92
142
0
09 Apr 2021
The AS-NU System for the M2VoC Challenge
The AS-NU System for the M2VoC Challenge
Cheng-Hung Hu
Yi-Chiao Wu
Wen-Chin Huang
Yu-Huai Peng
Yu-Wen Chen
Pin-Jui Ku
Tomoki Toda
Yu Tsao
Hsin-Min Wang
54
1
0
07 Apr 2021
Creativity and Machine Learning: A Survey
Creativity and Machine Learning: A Survey
Giorgio Franceschelli
Mirco Musolesi
VLMAI4CE
129
43
0
06 Apr 2021
Noise Estimation for Generative Diffusion Models
Noise Estimation for Generative Diffusion Models
Robin San-Roman
Eliya Nachmani
Lior Wolf
DiffM
136
107
0
06 Apr 2021
NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling
NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling
Junhyeok Lee
Seungu Han
DiffM
83
70
0
06 Apr 2021
Training Deep Normalizing Flow Models in Highly Incomplete Data
  Scenarios with Prior Regularization
Training Deep Normalizing Flow Models in Highly Incomplete Data Scenarios with Prior Regularization
Edgar A. Bernal
43
1
0
03 Apr 2021
Diff-TTS: A Denoising Diffusion Model for Text-to-Speech
Diff-TTS: A Denoising Diffusion Model for Text-to-Speech
Myeonghun Jeong
Hyeongju Kim
Sung Jun Cheon
Byoung Jin Choi
N. Kim
DiffM
72
197
0
03 Apr 2021
PATE-AAE: Incorporating Adversarial Autoencoder into Private Aggregation
  of Teacher Ensembles for Spoken Command Classification
PATE-AAE: Incorporating Adversarial Autoencoder into Private Aggregation of Teacher Ensembles for Spoken Command Classification
Chao-Han Huck Yang
Sabato Marco Siniscalchi
Chin-Hui Lee
134
22
0
02 Apr 2021
Learnable Dynamic Temporal Pooling for Time Series Classification
Learnable Dynamic Temporal Pooling for Time Series Classification
Dongha Lee
Seonghyeon Lee
Hwanjo Yu
AI4TS
92
29
0
02 Apr 2021
Assem-VC: Realistic Voice Conversion by Assembling Modern Speech
  Synthesis Techniques
Assem-VC: Realistic Voice Conversion by Assembling Modern Speech Synthesis Techniques
Kang-Wook Kim
Seung-won Park
Junhyeok Lee
Myun-chul Joe
76
28
0
02 Apr 2021
Multi-rate attention architecture for fast streamable Text-to-speech
  spectrum modeling
Multi-rate attention architecture for fast streamable Text-to-speech spectrum modeling
Qing He
Zhiping Xiu
T. Koehler
Jilong Wu
75
7
0
01 Apr 2021
Speech Resynthesis from Discrete Disentangled Self-Supervised
  Representations
Speech Resynthesis from Discrete Disentangled Self-Supervised Representations
Adam Polyak
Yossi Adi
Jade Copet
Eugene Kharitonov
Kushal Lakhotia
Wei-Ning Hsu
Abdel-rahman Mohamed
Emmanuel Dupoux
147
318
0
01 Apr 2021
CycleDRUMS: Automatic Drum Arrangement For Bass Lines Using CycleGAN
CycleDRUMS: Automatic Drum Arrangement For Bass Lines Using CycleGAN
Giorgio Barnabò
Giovanni Trappolini
L. Lastilla
Cesare Campagnano
Angela Fan
Fabio Petroni
Fabrizio Silvestri
66
4
0
01 Apr 2021
Collaborative Learning to Generate Audio-Video Jointly
Collaborative Learning to Generate Audio-Video Jointly
V. Kurmi
Vipul Bajaj
Badri N. Patro
K. Venkatesh
Vinay P. Namboodiri
Preethi Jyothi
VGen
60
11
0
01 Apr 2021
Adversarial Attacks and Defenses for Speech Recognition Systems
Adversarial Attacks and Defenses for Speech Recognition Systems
Piotr Żelasko
Sonal Joshi
Yiwen Shao
Jesus Villalba
J. Trmal
Najim Dehak
Sanjeev Khudanpur
AAML
63
29
0
31 Mar 2021
Wave based damage detection in solid structures using artificial neural
  networks
Wave based damage detection in solid structures using artificial neural networks
Frank Wuttke
Hao Lyu
A. Sattari
Z. Rizvi
27
1
0
30 Mar 2021
iVPF: Numerical Invertible Volume Preserving Flow for Efficient Lossless
  Compression
iVPF: Numerical Invertible Volume Preserving Flow for Efficient Lossless Compression
Shifeng Zhang
Chen Zhang
Ning Kang
Zhenguo Li
75
38
0
30 Mar 2021
Symbolic Music Generation with Diffusion Models
Symbolic Music Generation with Diffusion Models
Gautam Mittal
Jesse Engel
Curtis Hawthorne
Ian Simon
MGenDiffM
113
194
0
30 Mar 2021
PixelTransformer: Sample Conditioned Signal Generation
PixelTransformer: Sample Conditioned Signal Generation
Shubham Tulsiani
Abhinav Gupta
76
17
0
29 Mar 2021
A Temporal Kernel Approach for Deep Learning with Continuous-time
  Information
A Temporal Kernel Approach for Deep Learning with Continuous-time Information
Da Xu
Chuanwei Ruan
Evren Körpeoglu
Sushant Kumar
Kannan Achan
SyDaAI4TS
54
5
0
28 Mar 2021
Improved Autoregressive Modeling with Distribution Smoothing
Improved Autoregressive Modeling with Distribution Smoothing
Chenlin Meng
Jiaming Song
Yang Song
Shengjia Zhao
Stefano Ermon
DiffM
76
23
0
28 Mar 2021
PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS
PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS
Ye Jia
Heiga Zen
Jonathan Shen
Yu Zhang
Yonghui Wu
SSL
103
84
0
28 Mar 2021
Scalable and Efficient Neural Speech Coding: A Hybrid Design
Scalable and Efficient Neural Speech Coding: A Hybrid Design
Kai Zhen
Jongmo Sung
Mi Suk Lee
Seung-Wha Beack
Minje Kim
95
14
0
27 Mar 2021
Online structural health monitoring by model order reduction and deep
  learning algorithms
Online structural health monitoring by model order reduction and deep learning algorithms
Luca Rosafalco
Matteo Torzoni
Andrea Manzoni
S. Mariani
A. Corigliano
OffRL
34
49
0
26 Mar 2021
Improve GAN-based Neural Vocoder using Pointwise Relativistic
  LeastSquare GAN
Improve GAN-based Neural Vocoder using Pointwise Relativistic LeastSquare GAN
Cong Wang
Yu Chen
Bin Wang
Yi Shi
146
1
0
26 Mar 2021
Image2Reverb: Cross-Modal Reverb Impulse Response Synthesis
Image2Reverb: Cross-Modal Reverb Impulse Response Synthesis
Nikhil Singh
Jeff Mentch
Jerry Ng
Matthew Beveridge
Iddo Drori
65
47
0
26 Mar 2021
Under Pressure: Learning to Detect Slip with Barometric Tactile Sensors
Under Pressure: Learning to Detect Slip with Barometric Tactile Sensors
Abhinav Grover
C. Grebe
Philippe Nadeau
Jonathan Kelly
51
6
0
24 Mar 2021
Learned complex masks for multi-instrument source separation
Learned complex masks for multi-instrument source separation
Andreas Jansson
Rachel M. Bittner
N. Montecchio
Tillman Weyde
17
0
0
23 Mar 2021
CLIP: Cheap Lipschitz Training of Neural Networks
CLIP: Cheap Lipschitz Training of Neural Networks
Leon Bungert
René Raab
Tim Roith
Leo Schwinn
Daniel Tenbrinck
59
33
0
23 Mar 2021
Tiny Transformers for Environmental Sound Classification at the Edge
Tiny Transformers for Environmental Sound Classification at the Edge
David Elliott
Carlos E. Otero
Steven Wyatt
Evan Martino
81
16
0
22 Mar 2021
An Experimental Review on Deep Learning Architectures for Time Series
  Forecasting
An Experimental Review on Deep Learning Architectures for Time Series Forecasting
Pedro Lara-Benítez
Manuel Carranza-García
José Cristóbal Riquelme Santos
AI4TS
109
318
0
22 Mar 2021
Learning physical properties of anomalous random walks using graph
  neural networks
Learning physical properties of anomalous random walks using graph neural networks
Hippolyte Verdier
M. Duval
François Laurent
Alhassan Cassé
Christian L. Vestergaard
Jean-Baptiste Masson
71
25
0
22 Mar 2021
SwissDial: Parallel Multidialectal Corpus of Spoken Swiss German
SwissDial: Parallel Multidialectal Corpus of Spoken Swiss German
Pelin Dogan-Schönberger
Julian Mäder
Thomas Hofmann
65
30
0
21 Mar 2021
Graph Attention Recurrent Neural Networks for Correlated Time Series
  Forecasting -- Full version
Graph Attention Recurrent Neural Networks for Correlated Time Series Forecasting -- Full version
Razvan-Gabriel Cirstea
Chenjuan Guo
B. Yang
AI4TS
100
43
0
19 Mar 2021
GPNAS: A Neural Network Architecture Search Framework Based on Graphical
  Predictor
GPNAS: A Neural Network Architecture Search Framework Based on Graphical Predictor
Dige Ai
Hong Zhang
AI4CE
53
0
0
19 Mar 2021
Improving Zero-shot Voice Style Transfer via Disentangled Representation
  Learning
Improving Zero-shot Voice Style Transfer via Disentangled Representation Learning
Siyang Yuan
Pengyu Cheng
Ruiyi Zhang
Weituo Hao
Zhe Gan
Lawrence Carin
DRL
74
61
0
17 Mar 2021
Lightweight and interpretable neural modeling of an audio distortion
  effect using hyperconditioned differentiable biquads
Lightweight and interpretable neural modeling of an audio distortion effect using hyperconditioned differentiable biquads
S. Nercessian
Andy M. Sarroff
K. Werner
40
29
0
15 Mar 2021
Signal Representations for Synthesizing Audio Textures with Generative
  Adversarial Networks
Signal Representations for Synthesizing Audio Textures with Generative Adversarial Networks
Chitralekha Gupta
Purnima Kamath
L. Wyse
60
9
0
12 Mar 2021
Latent Space Explorations of Singing Voice Synthesis using DDSP
Latent Space Explorations of Singing Voice Synthesis using DDSP
J. Alonso
Cumhur Erkut
145
12
0
12 Mar 2021
Tensor networks and efficient descriptions of classical data
Tensor networks and efficient descriptions of classical data
Sirui Lu
Márton Kanász-Nagy
I. Kukuljan
J. I. Cirac
54
26
0
11 Mar 2021
Variable-rate discrete representation learning
Variable-rate discrete representation learning
Sander Dieleman
C. Nash
Jesse Engel
Karen Simonyan
BDLDRL
82
24
0
10 Mar 2021
GAN Vocoder: Multi-Resolution Discriminator Is All You Need
GAN Vocoder: Multi-Resolution Discriminator Is All You Need
J. You
Dalhyun Kim
Gyuhyeon Nam
Geumbyeol Hwang
Gyeongsu Chae
68
27
0
09 Mar 2021
Deep Generative Modelling: A Comparative Review of VAEs, GANs,
  Normalizing Flows, Energy-Based and Autoregressive Models
Deep Generative Modelling: A Comparative Review of VAEs, GANs, Normalizing Flows, Energy-Based and Autoregressive Models
Sam Bond-Taylor
Adam Leach
Yang Long
Chris G. Willcocks
VLMTPM
193
511
0
08 Mar 2021
CUHK-EE Voice Cloning System for ICASSP 2021 M2VoC Challenge
CUHK-EE Voice Cloning System for ICASSP 2021 M2VoC Challenge
Daxin Tan
Hingpang Huang
Guangyan Zhang
Tan Lee
65
6
0
08 Mar 2021
Previous
123...313233...606162
Next