ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.03499
  4. Cited By
WaveNet: A Generative Model for Raw Audio
v1v2 (latest)

WaveNet: A Generative Model for Raw Audio

12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
    DiffM
ArXiv (abs)PDFHTML

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,082 papers shown
Title
Denoising Diffusion Implicit Models
Denoising Diffusion Implicit Models
Jiaming Song
Chenlin Meng
Stefano Ermon
VLMDiffM
355
7,541
0
06 Oct 2020
JSSS: free Japanese speech corpus for summarization and simplification
JSSS: free Japanese speech corpus for summarization and simplification
Shinnosuke Takamichi
Mamoru Komachi
Naoko Tanji
Hiroshi Saruwatari
32
1
0
05 Oct 2020
D3Net: Densely connected multidilated DenseNet for music source
  separation
D3Net: Densely connected multidilated DenseNet for music source separation
Naoya Takahashi
Yuki Mitsufuji
MedIm
183
70
0
05 Oct 2020
VAEBM: A Symbiosis between Variational Autoencoders and Energy-based
  Models
VAEBM: A Symbiosis between Variational Autoencoders and Energy-based Models
Zhisheng Xiao
Karsten Kreis
Jan Kautz
Arash Vahdat
120
124
0
01 Oct 2020
MQTransformer: Multi-Horizon Forecasts with Context Dependent and
  Feedback-Aware Attention
MQTransformer: Multi-Horizon Forecasts with Context Dependent and Feedback-Aware Attention
Carson Eisenach
Yagna Patel
Dhruv Madeka
AI4TS
107
37
0
30 Sep 2020
Dilated Convolutional Attention Network for Medical Code Assignment from
  Clinical Text
Dilated Convolutional Attention Network for Medical Code Assignment from Clinical Text
Shaoxiong Ji
Min Zhang
Pekka Marttinen
MedIm
68
35
0
30 Sep 2020
Transfer Learning from Speech Synthesis to Voice Conversion with
  Non-Parallel Training Data
Transfer Learning from Speech Synthesis to Voice Conversion with Non-Parallel Training Data
Mingyang Zhang
Yi Zhou
Li Zhao
Haizhou Li
92
53
0
30 Sep 2020
Stock2Vec: A Hybrid Deep Learning Framework for Stock Market Prediction
  with Representation Learning and Temporal Convolutional Network
Stock2Vec: A Hybrid Deep Learning Framework for Stock Market Prediction with Representation Learning and Temporal Convolutional Network
Xing Wang
Yijun Wang
Bin Weng
Aleksandr Vinel
AIFinAI4TS
77
11
0
29 Sep 2020
Lip-reading with Densely Connected Temporal Convolutional Networks
Lip-reading with Densely Connected Temporal Convolutional Networks
Pingchuan Ma
Yujiang Wang
Jie Shen
Stavros Petridis
Maja Pantic
83
58
0
29 Sep 2020
Variational Temporal Deep Generative Model for Radar HRRP Target
  Recognition
Variational Temporal Deep Generative Model for Radar HRRP Target Recognition
D. Guo
Bo Chen
Wenchao Chen
Changbo Wang
Hongwei Liu
Mingyuan Zhou
BDL
88
36
0
28 Sep 2020
Recognition and Synthesis of Object Transport Motion
Recognition and Synthesis of Object Transport Motion
Connor Daly
GAN
16
0
0
27 Sep 2020
Iterative Reconstruction for Low-Dose CT using Deep Gradient Priors of
  Generative Model
Iterative Reconstruction for Low-Dose CT using Deep Gradient Priors of Generative Model
Zhuonan He
Yikun Zhang
Yu Guan
S. Niu
Yi Zhang
Yang Chen
Qiegen Liu
DiffMMedIm
93
12
0
27 Sep 2020
N-BEATS neural network for mid-term electricity load forecasting
N-BEATS neural network for mid-term electricity load forecasting
Boris N. Oreshkin
Grzegorz Dudek
Paweł Pełka
Ekaterina Turkina
AI4TS
50
84
0
24 Sep 2020
Haar Wavelet based Block Autoregressive Flows for Trajectories
Haar Wavelet based Block Autoregressive Flows for Trajectories
Apratim Bhattacharyya
C. Straehle
Mario Fritz
Bernt Schiele
AI4TS
70
15
0
21 Sep 2020
FakeTagger: Robust Safeguards against DeepFake Dissemination via
  Provenance Tracking
FakeTagger: Robust Safeguards against DeepFake Dissemination via Provenance Tracking
Run Wang
Felix Juefei Xu
Mengqing Luo
Yang Liu
Lina Wang
116
76
0
21 Sep 2020
DiffWave: A Versatile Diffusion Model for Audio Synthesis
DiffWave: A Versatile Diffusion Model for Audio Synthesis
Zhifeng Kong
Ming-Yu Liu
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
DiffMBDL
253
1,472
0
21 Sep 2020
Shimon the Rapper: A Real-Time System for Human-Robot Interactive Rap
  Battles
Shimon the Rapper: A Real-Time System for Human-Robot Interactive Rap Battles
Richard J. Savery
Lisa Zahray
Gil Weinberg
54
17
0
19 Sep 2020
GeneraLight: Improving Environment Generalization of Traffic Signal
  Control via Meta Reinforcement Learning
GeneraLight: Improving Environment Generalization of Traffic Signal Control via Meta Reinforcement Learning
Chang-rui Liu
Huichu Zhang
Weinan Zhang
Guanjie Zheng
Yong Yu
48
45
0
17 Sep 2020
Recurrent autoencoder with sequence-aware encoding
Recurrent autoencoder with sequence-aware encoding
Robert Susik
AI4TS
18
6
0
15 Sep 2020
Controllable neural text-to-speech synthesis using intuitive prosodic
  features
Controllable neural text-to-speech synthesis using intuitive prosodic features
T. Raitio
Ramya Rasipuram
D. Castellani
78
66
0
14 Sep 2020
Adaptive Convolution Kernel for Artificial Neural Networks
Adaptive Convolution Kernel for Artificial Neural Networks
F. B. Tek
Ilker Çam
D. Karli
33
14
0
14 Sep 2020
Visual-speech Synthesis of Exaggerated Corrective Feedback
Visual-speech Synthesis of Exaggerated Corrective Feedback
Yaohua Bu
Weijun Li
Tianyi Ma
S. Chen
Jia Jia
Kun Li
Xiaobo Lu
45
1
0
12 Sep 2020
Activation Relaxation: A Local Dynamical Approximation to
  Backpropagation in the Brain
Activation Relaxation: A Local Dynamical Approximation to Backpropagation in the Brain
Beren Millidge
Alexander Tschantz
A. Seth
Christopher L. Buckley
ODL
81
17
0
11 Sep 2020
Exploration of End-to-end Synthesisers forZero Resource Speech Challenge
  2020
Exploration of End-to-end Synthesisers forZero Resource Speech Challenge 2020
Karthik Pandia D.S.
Anusha Prakash
M. M.
H. Murthy
42
4
0
10 Sep 2020
Improved Modeling of 3D Shapes with Multi-view Depth Maps
Improved Modeling of 3D Shapes with Multi-view Depth Maps
Kamal Gupta
Susmija Jabbireddy
Ketul Shah
Abhinav Shrivastava
Matthias Zwicker
3DV
43
5
0
07 Sep 2020
Proximity Sensing: Modeling and Understanding Noisy RSSI-BLE Signals and
  Other Mobile Sensor Data for Digital Contact Tracing
Proximity Sensing: Modeling and Understanding Noisy RSSI-BLE Signals and Other Mobile Sensor Data for Digital Contact Tracing
Sheshank Shankar
Rishank Kanaparti
Ayush Chopra
Rohan Sukumaran
Parth Patwa
Myungsun Kang
Abhishek Singh
Kevin P. McPherson
Ramesh Raskar
64
3
0
04 Sep 2020
HiFiSinger: Towards High-Fidelity Neural Singing Voice Synthesis
HiFiSinger: Towards High-Fidelity Neural Singing Voice Synthesis
Jiawei Chen
Xu Tan
Jian Luan
Tao Qin
Tie-Yan Liu
VLM
104
93
0
03 Sep 2020
Voice Conversion by Cascading Automatic Speech Recognition and
  Text-to-Speech Synthesis with Prosody Transfer
Voice Conversion by Cascading Automatic Speech Recognition and Text-to-Speech Synthesis with Prosody Transfer
Jing-Xuan Zhang
Li-Juan Liu
Yan-Nian Chen
Ya-Jun Hu
Yuan Jiang
Zhenhua Ling
Lirong Dai
57
17
0
03 Sep 2020
WaveGrad: Estimating Gradients for Waveform Generation
WaveGrad: Estimating Gradients for Waveform Generation
Nanxin Chen
Yu Zhang
Heiga Zen
Ron J. Weiss
Mohammad Norouzi
William Chan
DiffMBDL
164
795
0
02 Sep 2020
LAVARNET: Neural Network Modeling of Causal Variable Relationships for
  Multivariate Time Series Forecasting
LAVARNET: Neural Network Modeling of Causal Variable Relationships for Multivariate Time Series Forecasting
C. Koutlis
Symeon Papadopoulos
Emmanouil Schinas
Y. Kompatsiaris
CMLAI4TS
33
16
0
02 Sep 2020
Hierarchical Timbre-Painting and Articulation Generation
Hierarchical Timbre-Painting and Articulation Generation
Michael Michelashvili
Lior Wolf
86
12
0
30 Aug 2020
Dynamical Variational Autoencoders: A Comprehensive Review
Dynamical Variational Autoencoders: A Comprehensive Review
Laurent Girin
Simon Leglaive
Xiaoyu Bie
Julien Diard
Thomas Hueber
Xavier Alameda-Pineda
BDL
150
223
0
28 Aug 2020
Voice Conversion Challenge 2020: Intra-lingual semi-parallel and
  cross-lingual voice conversion
Voice Conversion Challenge 2020: Intra-lingual semi-parallel and cross-lingual voice conversion
Yi Zhao
Wen-Chin Huang
Xiaohai Tian
Junichi Yamagishi
Rohan Kumar Das
Tomi Kinnunen
Zhenhua Ling
Tomoki Toda
88
211
0
28 Aug 2020
DrumGAN: Synthesis of Drum Sounds With Timbral Feature Conditioning
  Using Generative Adversarial Networks
DrumGAN: Synthesis of Drum Sounds With Timbral Feature Conditioning Using Generative Adversarial Networks
J. Nistal
Stefan Lattner
G. Richard
GAN
86
56
0
27 Aug 2020
Nonparallel Voice Conversion with Augmented Classifier Star Generative
  Adversarial Networks
Nonparallel Voice Conversion with Augmented Classifier Star Generative Adversarial Networks
Hirokazu Kameoka
Takuhiro Kaneko
Kou Tanaka
Nobukatsu Hojo
99
20
0
27 Aug 2020
DeepVOX: Discovering Features from Raw Audio for Speaker Recognition in
  Non-ideal Audio Signals
DeepVOX: Discovering Features from Raw Audio for Speaker Recognition in Non-ideal Audio Signals
Anurag Chowdhury
Arun Ross
34
2
0
26 Aug 2020
Generating Handwriting via Decoupled Style Descriptors
Generating Handwriting via Decoupled Style Descriptors
Atsunobu Kotani
Stefanie Tellex
James Tompkin
72
25
0
26 Aug 2020
ANGUS: Real-time manipulation of vocal roughness for emotional speech
  transformations
ANGUS: Real-time manipulation of vocal roughness for emotional speech transformations
M. Liuni
Luc Ardaillon
L. Bonal
Lou Seropian
J. Aucouturier
8
4
0
25 Aug 2020
Medley2K: A Dataset of Medley Transitions
Medley2K: A Dataset of Medley Transitions
Lukas Faber
Sandro Luck
Damian Pascual
Andreas Roth
Gino Brunner
Roger Wattenhofer
MedIm
53
0
0
25 Aug 2020
Using Deep Networks for Scientific Discovery in Physiological Signals
Using Deep Networks for Scientific Discovery in Physiological Signals
Tom Beer
Bar Eini-Porat
Sebastian Goodfellow
Danny Eytan
Uri Shalit
59
5
0
25 Aug 2020
Dynamic Future Net: Diversified Human Motion Generation
Dynamic Future Net: Diversified Human Motion Generation
Wenheng Chen
He Wang
Yi Yuan
Tianjia Shao
Kun Zhou
3DH
98
23
0
25 Aug 2020
ATM Cash demand forecasting in an Indian Bank with chaos and deep
  learning
ATM Cash demand forecasting in an Indian Bank with chaos and deep learning
Sarveswararao Vangala
V. Ravi
BDL
55
21
0
24 Aug 2020
RespVAD: Voice Activity Detection via Video-Extracted Respiration
  Patterns
RespVAD: Voice Activity Detection via Video-Extracted Respiration Patterns
A. Mondal
Prathosh A.P.
VGen
20
3
0
21 Aug 2020
asya: Mindful verbal communication using deep learning
asya: Mindful verbal communication using deep learning
Ē. Urtāns
Ariel Tabaks
VLM
121
1
0
20 Aug 2020
Laughter Synthesis: Combining Seq2seq modeling with Transfer Learning
Laughter Synthesis: Combining Seq2seq modeling with Transfer Learning
Noé Tits
Kevin El Haddad
Thierry Dutoit
31
14
0
20 Aug 2020
Learning to Generate Diverse Dance Motions with Transformer
Learning to Generate Diverse Dance Motions with Transformer
Jiaman Li
Yihang Yin
Hang Chu
Yi Zhou
Tingwu Wang
Sanja Fidler
Hao Li
86
125
0
18 Aug 2020
Efficient Low-Latency Speech Enhancement with Mobile Audio Streaming
  Networks
Efficient Low-Latency Speech Enhancement with Mobile Audio Streaming Networks
Michal Romaniuk
Piotr Masztalski
K. Piaskowski
M. Matuszewski
25
5
0
17 Aug 2020
POP909: A Pop-song Dataset for Music Arrangement Generation
POP909: A Pop-song Dataset for Music Arrangement Generation
Ziyu Wang
Kai Chen
Junyan Jiang
Yiyi Zhang
Maoran Xu
Shuqi Dai
Xianbin Gu
Gus Xia
76
139
0
17 Aug 2020
Unsupervised Acoustic Unit Representation Learning for Voice Conversion
  using WaveNet Auto-encoders
Unsupervised Acoustic Unit Representation Learning for Voice Conversion using WaveNet Auto-encoders
Mingjie Chen
Thomas Hain
SSLDRL
54
15
0
16 Aug 2020
Audio Dequantization for High Fidelity Audio Generation in Flow-based
  Neural Vocoder
Audio Dequantization for High Fidelity Audio Generation in Flow-based Neural Vocoder
Hyun-Wook Yoon
Sang-Hoon Lee
Hyeong-Rae Noh
Seong-Whan Lee
111
11
0
16 Aug 2020
Previous
123...363738...606162
Next