ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.03499
  4. Cited By
WaveNet: A Generative Model for Raw Audio

WaveNet: A Generative Model for Raw Audio

12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
    DiffM
ArXivPDFHTML

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,046 papers shown
Title
Don't Generate Me: Training Differentially Private Generative Models
  with Sinkhorn Divergence
Don't Generate Me: Training Differentially Private Generative Models with Sinkhorn Divergence
Tianshi Cao
Alex Bie
Arash Vahdat
Sanja Fidler
Karsten Kreis
SyDa
DiffM
39
71
0
01 Nov 2021
RefineGAN: Universally Generating Waveform Better than Ground Truth with
  Highly Accurate Pitch and Intensity Responses
RefineGAN: Universally Generating Waveform Better than Ground Truth with Highly Accurate Pitch and Intensity Responses
Shengyuan Xu
Wenxiao Zhao
Jing Guo
30
12
0
01 Nov 2021
Efficiently Modeling Long Sequences with Structured State Spaces
Efficiently Modeling Long Sequences with Structured State Spaces
Albert Gu
Karan Goel
Christopher Ré
118
1,709
0
31 Oct 2021
QDCNN: Quantum Dilated Convolutional Neural Network
QDCNN: Quantum Dilated Convolutional Neural Network
Yixiong Chen
32
4
0
29 Oct 2021
Ask "Who", Not "What": Bitcoin Volatility Forecasting with Twitter Data
Ask "Who", Not "What": Bitcoin Volatility Forecasting with Twitter Data
M. E. Akbiyik
Mert Erkul
Killian Kaempf
V. Vasiliauskaite
Nino Antulov-Fantulin
OOD
25
9
0
27 Oct 2021
Assessing Evaluation Metrics for Speech-to-Speech Translation
Assessing Evaluation Metrics for Speech-to-Speech Translation
Elizabeth Salesky
Julian Mäder
Severin Klinger
42
14
0
26 Oct 2021
Probabilistic Hierarchical Forecasting with Deep Poisson Mixtures
Probabilistic Hierarchical Forecasting with Deep Poisson Mixtures
Kin G. Olivares
N. Meetei
Ruijun Ma
Rohan Reddy
Mengfei Cao
Lee Dicker
AI4TS
39
24
0
25 Oct 2021
Neural Flows: Efficient Alternative to Neural ODEs
Neural Flows: Efficient Alternative to Neural ODEs
Marin Bilovs
Johanna Sommer
Syama Sundar Rangapuram
Tim Januschowski
Stephan Günnemann
AI4TS
43
71
0
25 Oct 2021
Actions Speak Louder than Listening: Evaluating Music Style Transfer
  based on Editing Experience
Actions Speak Louder than Listening: Evaluating Music Style Transfer based on Editing Experience
Weiyi Lu
Meng-Hsuan Wu
Yuh-ming Chiu
Li Su
32
0
0
25 Oct 2021
DelightfulTTS: The Microsoft Speech Synthesis System for Blizzard
  Challenge 2021
DelightfulTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2021
Yanqing Liu
Rui Shao
G. Wang
Kuan Chen
Bohan Li
Pong C. Yuen
Jinzhu Li
Lei He
Sheng Zhao
44
55
0
25 Oct 2021
ProtoShotXAI: Using Prototypical Few-Shot Architecture for Explainable
  AI
ProtoShotXAI: Using Prototypical Few-Shot Architecture for Explainable AI
Samuel Hess
G. Ditzler
AAML
38
1
0
22 Oct 2021
Merging Two Cultures: Deep and Statistical Learning
Merging Two Cultures: Deep and Statistical Learning
A. Bhadra
J. Datta
Nicholas G. Polson
Vadim Sokolov
Jianeng Xu
BDL
50
9
0
22 Oct 2021
A Data-Centric Optimization Framework for Machine Learning
A Data-Centric Optimization Framework for Machine Learning
Oliver Rausch
Tal Ben-Nun
Nikoli Dryden
Andrei Ivanov
Shigang Li
Torsten Hoefler
AI4CE
27
16
0
20 Oct 2021
What Averages Do Not Tell -- Predicting Real Life Processes with
  Sequential Deep Learning
What Averages Do Not Tell -- Predicting Real Life Processes with Sequential Deep Learning
István Ketykó
F. Mannhardt
Marwan Hassani
B. V. Dongen
AI4TS
16
10
0
19 Oct 2021
The CoRa Tensor Compiler: Compilation for Ragged Tensors with Minimal
  Padding
The CoRa Tensor Compiler: Compilation for Ragged Tensors with Minimal Padding
Pratik Fegade
Tianqi Chen
Phillip B. Gibbons
T. Mowry
35
29
0
19 Oct 2021
Chunked Autoregressive GAN for Conditional Waveform Synthesis
Chunked Autoregressive GAN for Conditional Waveform Synthesis
Max Morrison
Rithesh Kumar
Kundan Kumar
Prem Seetharaman
Aaron Courville
Yoshua Bengio
GAN
57
69
0
19 Oct 2021
CycleFlow: Purify Information Factors by Cycle Loss
CycleFlow: Purify Information Factors by Cycle Loss
Haoran Sun
Chen Chen
Lantian Li
Dong Wang
32
1
0
18 Oct 2021
KaraTuner: Towards end to end natural pitch correction for singing voice
  in karaoke
KaraTuner: Towards end to end natural pitch correction for singing voice in karaoke
Xiaobin Zhuang
Huiran Yu
Weifeng Zhao
Tao Jiang
Peng Hu
37
6
0
18 Oct 2021
VISinger: Variational Inference with Adversarial Learning for End-to-End
  Singing Voice Synthesis
VISinger: Variational Inference with Adversarial Learning for End-to-End Singing Voice Synthesis
Yongmao Zhang
Jian Cong
Heyang Xue
Lei Xie
Pengcheng Zhu
Mengxiao Bi
27
75
0
17 Oct 2021
Taming Visually Guided Sound Generation
Taming Visually Guided Sound Generation
Vladimir E. Iashin
Esa Rahtu
VLM
43
124
0
17 Oct 2021
Neural Dubber: Dubbing for Videos According to Scripts
Neural Dubber: Dubbing for Videos According to Scripts
Chenxu Hu
Qiao Tian
Tingle Li
Yuping Wang
Yuxuan Wang
Hang Zhao
DiffM
VGen
41
40
0
15 Oct 2021
Advances and Challenges in Deep Lip Reading
Advances and Challenges in Deep Lip Reading
Marzieh Oghbaie
Arian Sabaghi
Kooshan Hashemifard
Mohammad Akbari
VLM
35
15
0
15 Oct 2021
Diffusion Normalizing Flow
Diffusion Normalizing Flow
Qinsheng Zhang
Yongxin Chen
DiffM
40
88
0
14 Oct 2021
SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice
  Generation
SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation
Rongjie Huang
Chenye Cui
Feiyang Chen
Yi Ren
Jinglin Liu
Zhou Zhao
Baoxing Huai
N. Yuan
GAN
111
62
0
14 Oct 2021
SpecSinGAN: Sound Effect Variation Synthesis Using Single-Image GANs
SpecSinGAN: Sound Effect Variation Synthesis Using Single-Image GANs
Adrián Barahona-Ríos
Tom Collins
GAN
27
4
0
14 Oct 2021
Improve Cross-lingual Voice Cloning Using Low-quality Code-switched Data
Improve Cross-lingual Voice Cloning Using Low-quality Code-switched Data
Haitong Zhang
Yue Lin
31
0
0
14 Oct 2021
Multistage linguistic conditioning of convolutional layers for speech
  emotion recognition
Multistage linguistic conditioning of convolutional layers for speech emotion recognition
Andreas Triantafyllopoulos
U. Reichel
Shuo Liu
Simon Huber
F. Eyben
Björn W. Schuller
39
10
0
13 Oct 2021
A Melody-Unsupervision Model for Singing Voice Synthesis
A Melody-Unsupervision Model for Singing Voice Synthesis
Soonbeom Choi
Juhan Nam
29
14
0
13 Oct 2021
DeepA: A Deep Neural Analyzer For Speech And Singing Vocoding
DeepA: A Deep Neural Analyzer For Speech And Singing Vocoding
Sergey Nikonorov
Berrak Sisman
Mingyang Zhang
Haizhou Li
23
2
0
13 Oct 2021
A Multi-scale Time-series Dataset with Benchmark for Machine Learning in
  Decarbonized Energy Grids
A Multi-scale Time-series Dataset with Benchmark for Machine Learning in Decarbonized Energy Grids
Xiangtian Zheng
Nan Xu
Loc Trinh
Dongqi Wu
Tong Huang
S. Sivaranjani
Yan Liu
Le Xie
AI4CE
46
44
0
12 Oct 2021
Adapting TTS models For New Speakers using Transfer Learning
Adapting TTS models For New Speakers using Transfer Learning
Paarth Neekhara
Jason Chun Lok Li
Boris Ginsburg
60
15
0
12 Oct 2021
Unsupervised Source Separation via Bayesian Inference in the Latent
  Domain
Unsupervised Source Separation via Bayesian Inference in the Latent Domain
Michele Mancusi
Emilian Postolache
Giorgio Mariani
Marco Fumero
Andrea Santilli
Luca Cosmo
Emanuele Rodolà
BDL
32
2
0
11 Oct 2021
Pitch Preservation In Singing Voice Synthesis
Pitch Preservation In Singing Voice Synthesis
Shujun Liu
Hai Zhu
Kun Wang
Huajun Wang
28
0
0
11 Oct 2021
Application of Graph Convolutions in a Lightweight Model for Skeletal
  Human Motion Forecasting
Application of Graph Convolutions in a Lightweight Model for Skeletal Human Motion Forecasting
L. Hermes
Barbara Hammer
M. Schilling
3DH
29
4
0
10 Oct 2021
Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in
  High-order Latent Domain
Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in High-order Latent Domain
Zengwei Yao
Wenjie Pei
Fanglin Chen
Guangming Lu
David C. Zhang
26
12
0
10 Oct 2021
Denoising Diffusion Gamma Models
Denoising Diffusion Gamma Models
Eliya Nachmani
S. Robin
Lior Wolf
DiffM
VLM
26
31
0
10 Oct 2021
F-Divergences and Cost Function Locality in Generative Modelling with
  Quantum Circuits
F-Divergences and Cost Function Locality in Generative Modelling with Quantum Circuits
Chiara Leadbeater
Louis Sharrock
Brian Coyle
Marcello Benedetti
40
11
0
08 Oct 2021
Temporal Convolutions for Multi-Step Quadrotor Motion Prediction
Temporal Convolutions for Multi-Step Quadrotor Motion Prediction
Sam Looper
Steven L. Waslander
58
5
0
08 Oct 2021
Cross-speaker Emotion Transfer Based on Speaker Condition Layer
  Normalization and Semi-Supervised Training in Text-To-Speech
Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech
Pengfei Wu
Junjie Pan
Chenchang Xu
Junhui Zhang
Lin Wu
Xiang Yin
Zejun Ma
26
16
0
08 Oct 2021
MilliTRACE-IR: Contact Tracing and Temperature Screening via mm-Wave and
  Infrared Sensing
MilliTRACE-IR: Contact Tracing and Temperature Screening via mm-Wave and Infrared Sensing
Marco Canil
Jacopo Pegoraro
Michele Rossi
37
12
0
08 Oct 2021
ATISS: Autoregressive Transformers for Indoor Scene Synthesis
ATISS: Autoregressive Transformers for Indoor Scene Synthesis
Despoina Paschalidou
Amlan Kar
Maria Shugrina
Karsten Kreis
Andreas Geiger
Sanja Fidler
3DV
ViT
68
149
0
07 Oct 2021
Cloning one's voice using very limited data in the wild
Cloning one's voice using very limited data in the wild
Dongyang Dai
Yuan-Jui Chen
Li Chen
Ming Tu
Lu Liu
Rui Xia
Qiao Tian
Yuping Wang
Yuxuan Wang
SyDa
33
10
0
07 Oct 2021
VisualTTS: TTS with Accurate Lip-Speech Synchronization for Automatic
  Voice Over
VisualTTS: TTS with Accurate Lip-Speech Synchronization for Automatic Voice Over
Junchen Lu
Berrak Sisman
Rui Liu
Mingyang Zhang
Haizhou Li
DiffM
47
19
0
07 Oct 2021
Hierarchical prosody modeling and control in non-autoregressive parallel
  neural TTS
Hierarchical prosody modeling and control in non-autoregressive parallel neural TTS
T. Raitio
Jiangchuan Li
Shreyas Seshadri
54
22
0
06 Oct 2021
GANtron: Emotional Speech Synthesis with Generative Adversarial Networks
GANtron: Emotional Speech Synthesis with Generative Adversarial Networks
E. Hortal
Rodrigo Brechard Alarcia
GAN
31
2
0
06 Oct 2021
3D-MOV: Audio-Visual LSTM Autoencoder for 3D Reconstruction of Multiple
  Objects from Video
3D-MOV: Audio-Visual LSTM Autoencoder for 3D Reconstruction of Multiple Objects from Video
Justin Wilson
Ming-Chia Lin
27
1
0
05 Oct 2021
Unsupervised Speech Segmentation and Variable Rate Representation
  Learning using Segmental Contrastive Predictive Coding
Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding
Saurabhchand Bhati
Jesús Villalba
Piotr Żelasko
Laureano Moro-Velazquez
Najim Dehak
SSL
69
22
0
05 Oct 2021
Networked Time Series Prediction with Incomplete Data via Generative
  Adversarial Network
Networked Time Series Prediction with Incomplete Data via Generative Adversarial Network
Yichen Zhu
Bo Jiang
Haiming Jin
Mengtian Zhang
Feng Gao
Jianqiang Huang
Tao Lin
Xinbing Wang
GNN
AI4TS
45
5
0
05 Oct 2021
Autoregressive Diffusion Models
Autoregressive Diffusion Models
Emiel Hoogeboom
Alexey A. Gritsenko
Jasmijn Bastings
Ben Poole
Rianne van den Berg
Tim Salimans
DiffM
50
149
0
05 Oct 2021
WaveBeat: End-to-end beat and downbeat tracking in the time domain
WaveBeat: End-to-end beat and downbeat tracking in the time domain
C. Steinmetz
Joshua D. Reiss
14
9
0
04 Oct 2021
Previous
123...252627...596061
Next