ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.03499
  4. Cited By
WaveNet: A Generative Model for Raw Audio
v1v2 (latest)

WaveNet: A Generative Model for Raw Audio

12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
    DiffM
ArXiv (abs)PDFHTML

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,082 papers shown
Title
Unsupervised Representation Learning for Time Series with Temporal
  Neighborhood Coding
Unsupervised Representation Learning for Time Series with Temporal Neighborhood Coding
S. Tonekaboni
Danny Eytan
Anna Goldenberg
CMLSSLAI4TS
171
298
0
01 Jun 2021
Enhancing Trajectory Prediction using Sparse Outputs: Application to
  Team Sports
Enhancing Trajectory Prediction using Sparse Outputs: Application to Team Sports
Brandon Victor
Aiden Nibali
Zhen He
D. Carey
39
9
0
01 Jun 2021
Continual 3D Convolutional Neural Networks for Real-time Processing of
  Videos
Continual 3D Convolutional Neural Networks for Real-time Processing of Videos
Lukas Hedegaard
Alexandros Iosifidis
3DPC
89
15
0
31 May 2021
StarGAN-ZSVC: Towards Zero-Shot Voice Conversion in Low-Resource
  Contexts
StarGAN-ZSVC: Towards Zero-Shot Voice Conversion in Low-Resource Contexts
Matthew Baas
Herman Kamper
53
6
0
31 May 2021
Multi-Scale Temporal Convolution Network for Classroom Voice Detection
Multi-Scale Temporal Convolution Network for Classroom Voice Detection
Lu Ma
Xintian Wang
Song Yang
Y. Gong
Zhongqin Wu
47
1
0
31 May 2021
Cascaded Diffusion Models for High Fidelity Image Generation
Cascaded Diffusion Models for High Fidelity Image Generation
Jonathan Ho
Chitwan Saharia
William Chan
David J. Fleet
Mohammad Norouzi
Tim Salimans
279
1,246
0
30 May 2021
DiffSVC: A Diffusion Probabilistic Model for Singing Voice Conversion
DiffSVC: A Diffusion Probabilistic Model for Singing Voice Conversion
Songxiang Liu
Yuewen Cao
Jane Polak Scowcroft
Helen Meng
DiffM
86
59
0
28 May 2021
DIVE: End-to-end Speech Diarization via Iterative Speaker Embedding
DIVE: End-to-end Speech Diarization via Iterative Speaker Embedding
Neil Zeghidour
O. Teboul
David Grangier
63
13
0
28 May 2021
Safe Model-based Off-policy Reinforcement Learning for Eco-Driving in
  Connected and Automated Hybrid Electric Vehicles
Safe Model-based Off-policy Reinforcement Learning for Eco-Driving in Connected and Automated Hybrid Electric Vehicles
Zhaoxuan Zhu
Nicola Pivaro
Shobhit Gupta
Abhishek Gupta
Marcello Canova
OffRL
65
37
0
25 May 2021
Inclusion of Domain-Knowledge into GNNs using Mode-Directed Inverse
  Entailment
Inclusion of Domain-Knowledge into GNNs using Mode-Directed Inverse Entailment
T. Dash
A. Srinivasan
A. Baskar
72
13
0
22 May 2021
Spatial-temporal Conv-sequence Learning with Accident Encoding for
  Traffic Flow Prediction
Spatial-temporal Conv-sequence Learning with Accident Encoding for Traffic Flow Prediction
Zichuan Liu
Rui Zhang
Chen Wang
Zhu Xiao
Hongbo Jiang
AI4TS
49
20
0
21 May 2021
LoopNet: Musical Loop Synthesis Conditioned On Intuitive Musical
  Parameters
LoopNet: Musical Loop Synthesis Conditioned On Intuitive Musical Parameters
Pritish Chandna
António Ramires
Xavier Serra
Emilia Gómez
61
4
0
21 May 2021
Temporal convolutional networks predict dynamic oxygen uptake response
  from wearable sensors across exercise intensities
Temporal convolutional networks predict dynamic oxygen uptake response from wearable sensors across exercise intensities
Robert Amelard
E. Hedge
R. Hughson
26
18
0
20 May 2021
High-Fidelity and Low-Latency Universal Neural Vocoder based on
  Multiband WaveRNN with Data-Driven Linear Prediction for Discrete Waveform
  Modeling
High-Fidelity and Low-Latency Universal Neural Vocoder based on Multiband WaveRNN with Data-Driven Linear Prediction for Discrete Waveform Modeling
Patrick Lumban Tobing
Tomoki Toda
67
8
0
20 May 2021
Parallel and Flexible Sampling from Autoregressive Models via Langevin
  Dynamics
Parallel and Flexible Sampling from Autoregressive Models via Langevin Dynamics
V. Jayaram
John Thickstun
DiffM
107
25
0
17 May 2021
Itsy Bitsy SpiderNet: Fully Connected Residual Network for Fraud
  Detection
Itsy Bitsy SpiderNet: Fully Connected Residual Network for Fraud Detection
S. Afanasiev
A. Smirnova
D. Kotereva
64
2
0
17 May 2021
ItôTTS and ItôWave: Linear Stochastic Differential Equation Is All
  You Need For Audio Generation
ItôTTS and ItôWave: Linear Stochastic Differential Equation Is All You Need For Audio Generation
Shoule Wu
Ziqiang Shi
DiffM
157
11
0
17 May 2021
Drill the Cork of Information Bottleneck by Inputting the Most Important
  Data
Drill the Cork of Information Bottleneck by Inputting the Most Important Data
Xinyu Peng
Jiawei Zhang
Feiyue Wang
Li Li
43
6
0
15 May 2021
Predicting speech intelligibility from EEG in a non-linear
  classification paradigm
Predicting speech intelligibility from EEG in a non-linear classification paradigm
Bernd Accou
Mohammad Jalilpour-Monesi
Hugo Van hamme
T. Francart
22
12
0
14 May 2021
Advances in Machine and Deep Learning for Modeling and Real-time
  Detection of Multi-Messenger Sources
Advances in Machine and Deep Learning for Modeling and Real-time Detection of Multi-Messenger Sources
Eliu A. Huerta
Zhizhen Zhao
106
21
0
13 May 2021
Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech
Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech
Vadim Popov
Ivan Vovk
Vladimir Gogoryan
Tasnima Sadekova
Mikhail Kudinov
DiffM
119
544
0
13 May 2021
Diffusion Models Beat GANs on Image Synthesis
Diffusion Models Beat GANs on Image Synthesis
Prafulla Dhariwal
Alex Nichol
496
8,017
0
11 May 2021
Lawformer: A Pre-trained Language Model for Chinese Legal Long Documents
Lawformer: A Pre-trained Language Model for Chinese Legal Long Documents
Chaojun Xiao
Xueyu Hu
Zhiyuan Liu
Cunchao Tu
Maosong Sun
AILawELM
104
245
0
09 May 2021
Machine Learning (ML)-Centric Resource Management in Cloud Computing: A
  Review and Future Directions
Machine Learning (ML)-Centric Resource Management in Cloud Computing: A Review and Future Directions
Tahseen Khan
Wenhong Tian
Rajkumar Buyya
58
106
0
09 May 2021
Latency-Controlled Neural Architecture Search for Streaming Speech
  Recognition
Latency-Controlled Neural Architecture Search for Streaming Speech Recognition
Liqiang He
Shulin Feng
Jane Polak Scowcroft
Dong Yu
54
0
0
08 May 2021
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
Jinglin Liu
Chengxi Li
Yi Ren
Feiyang Chen
Zhou Zhao
DiffM
198
271
0
06 May 2021
Non-Autoregressive vs Autoregressive Neural Networks for System
  Identification
Non-Autoregressive vs Autoregressive Neural Networks for System Identification
Daniel Weber
C. Gühmann
58
7
0
05 May 2021
CoSA: Scheduling by Constrained Optimization for Spatial Accelerators
CoSA: Scheduling by Constrained Optimization for Spatial Accelerators
Qijing Huang
Minwoo Kang
Grace Dinh
Thomas Norell
Aravind Kalaiah
J. Demmel
J. Wawrzynek
Y. Shao
72
112
0
05 May 2021
VQCPC-GAN: Variable-Length Adversarial Audio Synthesis Using
  Vector-Quantized Contrastive Predictive Coding
VQCPC-GAN: Variable-Length Adversarial Audio Synthesis Using Vector-Quantized Contrastive Predictive Coding
J. Nistal
Cyran Aouameur
Stefan Lattner
G. Richard
105
7
0
04 May 2021
OCTOPUS: Overcoming Performance andPrivatization Bottlenecks in
  Distributed Learning
OCTOPUS: Overcoming Performance andPrivatization Bottlenecks in Distributed Learning
Shuo Wang
Surya Nepal
Kristen Moore
M. Grobler
Carsten Rudolph
A. Abuadbba
FedML
69
8
0
03 May 2021
Autoregressive Dynamics Models for Offline Policy Evaluation and
  Optimization
Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization
Michael Ruogu Zhang
T. Paine
Ofir Nachum
Cosmin Paduraru
George Tucker
Ziyun Wang
Mohammad Norouzi
OffRL
91
49
0
28 Apr 2021
Learning deep autoregressive models for hierarchical data
Learning deep autoregressive models for hierarchical data
Carl R. Andersson
Niklas Wahlström
Thomas B. Schon
BDL
57
3
0
28 Apr 2021
BeamLearning: an end-to-end Deep Learning approach for the angular
  localization of sound sources using raw multichannel acoustic pressure data
BeamLearning: an end-to-end Deep Learning approach for the angular localization of sound sources using raw multichannel acoustic pressure data
Hadrien Pujol
Éric Bavu
Alexandre Garcia
90
22
0
27 Apr 2021
Sifting out the features by pruning: Are convolutional networks the
  winning lottery ticket of fully connected ones?
Sifting out the features by pruning: Are convolutional networks the winning lottery ticket of fully connected ones?
Franco Pellegrini
Giulio Biroli
111
6
0
27 Apr 2021
End-to-End Video-To-Speech Synthesis using Generative Adversarial
  Networks
End-to-End Video-To-Speech Synthesis using Generative Adversarial Networks
Rodrigo Mira
Konstantinos Vougioukas
Pingchuan Ma
Stavros Petridis
Björn W. Schuller
Maja Pantic
121
47
0
27 Apr 2021
Points2Sound: From mono to binaural audio using 3D point cloud scenes
Points2Sound: From mono to binaural audio using 3D point cloud scenes
Francesc Lluís
V. Chatziioannou
A. Hofmann
3DPC
120
6
0
26 Apr 2021
Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis
Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis
Erica Cooper
Xin Wang
Junichi Yamagishi
91
6
0
25 Apr 2021
Restoring degraded speech via a modified diffusion model
Restoring degraded speech via a modified diffusion model
Jianwei Zhang
Suren Jayasuriya
Visar Berisha
DiffM
64
21
0
22 Apr 2021
Scaling of neural-network quantum states for time evolution
Scaling of neural-network quantum states for time evolution
Sheng-Hsuan Lin
F. Pollmann
66
25
0
21 Apr 2021
Lossless Compression with Latent Variable Models
Lossless Compression with Latent Variable Models
James Townsend
BDLDRL
78
6
0
21 Apr 2021
Eye Know You: Metric Learning for End-to-end Biometric Authentication
  Using Eye Movements from a Longitudinal Dataset
Eye Know You: Metric Learning for End-to-end Biometric Authentication Using Eye Movements from a Longitudinal Dataset
Dillon Lohr
Henry K. Griffith
Oleg V. Komogortsev
89
33
0
21 Apr 2021
Superpixels and Graph Convolutional Neural Networks for Efficient
  Detection of Nutrient Deficiency Stress from Aerial Imagery
Superpixels and Graph Convolutional Neural Networks for Efficient Detection of Nutrient Deficiency Stress from Aerial Imagery
Saba Dadsetan
David Pichler
David Wilson
N. Hovakimyan
Jennifer Hobbs
99
6
0
20 Apr 2021
VideoGPT: Video Generation using VQ-VAE and Transformers
VideoGPT: Video Generation using VQ-VAE and Transformers
Wilson Yan
Yunzhi Zhang
Pieter Abbeel
A. Srinivas
ViTVGen
345
513
0
20 Apr 2021
Review of end-to-end speech synthesis technology based on deep learning
Review of end-to-end speech synthesis technology based on deep learning
Zhaoxi Mu
Xinyu Yang
Yizhuo Dong
AuLLMALM
94
25
0
20 Apr 2021
Mapping the Internet: Modelling Entity Interactions in Complex
  Heterogeneous Networks
Mapping the Internet: Modelling Entity Interactions in Complex Heterogeneous Networks
Šimon Mandlík
Tomás Pevný
52
5
0
19 Apr 2021
Recursive input and state estimation: A general framework for learning
  from time series with missing data
Recursive input and state estimation: A general framework for learning from time series with missing data
Alberto García-Durán
Robert West
AI4TS
30
2
0
17 Apr 2021
KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis Dataset
KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis Dataset
Saida Mussakhojayeva
Aigerim Janaliyeva
A. Mirzakhmetov
Yerbolat Khassanov
H. A. Varol
61
14
0
17 Apr 2021
MeshTalk: 3D Face Animation from Speech using Cross-Modality
  Disentanglement
MeshTalk: 3D Face Animation from Speech using Cross-Modality Disentanglement
Alexander Richard
Michael Zollhoefer
Yandong Wen
Fernando de la Torre
Yaser Sheikh
CVBM
97
202
0
16 Apr 2021
Efficient and Generic 1D Dilated Convolution Layer for Deep Learning
Efficient and Generic 1D Dilated Convolution Layer for Deep Learning
Narendra Chaudhary
Sanchit Misra
Dhiraj D. Kalamkar
A. Heinecke
E. Georganas
Barukh Ziv
Menachem Adelman
Bharat Kaul
61
9
0
16 Apr 2021
Spectrogram Inpainting for Interactive Generation of Instrument Sounds
Spectrogram Inpainting for Interactive Generation of Instrument Sounds
Théis Bazin
Gaëtan Hadjeres
P. Esling
M. Malt
61
11
0
15 Apr 2021
Previous
123...303132...606162
Next