ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.03499
  4. Cited By
WaveNet: A Generative Model for Raw Audio
v1v2 (latest)

WaveNet: A Generative Model for Raw Audio

12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
    DiffM
ArXiv (abs)PDFHTML

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,082 papers shown
Title
Distilling the Knowledge from Conditional Normalizing Flows
Distilling the Knowledge from Conditional Normalizing Flows
Dmitry Baranchuk
Vladimir Aliev
Artem Babenko
BDL
85
2
0
24 Jun 2021
Speech is Silver, Silence is Golden: What do ASVspoof-trained Models
  Really Learn?
Speech is Silver, Silence is Golden: What do ASVspoof-trained Models Really Learn?
Nicolas Müller
Franziska Dieckmann
Pavel Czempin
Roman Canals
Konstantin Böttinger
Jennifer Williams
106
71
0
23 Jun 2021
Recurrent Neural Network from Adder's Perspective: Carry-lookahead RNN
Recurrent Neural Network from Adder's Perspective: Carry-lookahead RNN
Haowei Jiang
Fei-wei Qin
Jin Cao
Yong Peng
Yanli Shao
LRMODL
68
43
0
22 Jun 2021
UniTTS: Residual Learning of Unified Embedding Space for Speech Style
  Control
UniTTS: Residual Learning of Unified Embedding Space for Speech Style Control
M. Kang
Sungjae Kim
Injung Kim
77
3
0
21 Jun 2021
Non-native English lexicon creation for bilingual speech synthesis
Non-native English lexicon creation for bilingual speech synthesis
Arun Baby
Pranav Jawale
Saranya Vinnaitherthan
Sumukh Badam
Nagaraj Adiga
Sharath Adavanne
44
8
0
21 Jun 2021
Glow-WaveGAN: Learning Speech Representations from GAN-based Variational
  Auto-Encoder For High Fidelity Flow-based Speech Synthesis
Glow-WaveGAN: Learning Speech Representations from GAN-based Variational Auto-Encoder For High Fidelity Flow-based Speech Synthesis
Jian Cong
Shan Yang
Lei Xie
Jane Polak Scowcroft
DRL
112
29
0
21 Jun 2021
Low-rank Characteristic Tensor Density Estimation Part II: Compression
  and Latent Density Estimation
Low-rank Characteristic Tensor Density Estimation Part II: Compression and Latent Density Estimation
Magda Amiridi
Nikos Kargas
N. Sidiropoulos
121
11
0
20 Jun 2021
Advances in Speech Vocoding for Text-to-Speech with Continuous
  Parameters
Advances in Speech Vocoding for Text-to-Speech with Continuous Parameters
M. S. Al-Radhi
Tamás Gábor Csapó
Géza Németh
13
2
0
19 Jun 2021
Deep Generative Learning via Schrödinger Bridge
Deep Generative Learning via Schrödinger Bridge
Gefei Wang
Yuling Jiao
Qiang Xu
Yang Wang
Can Yang
DiffMOT
102
103
0
19 Jun 2021
ScoreGrad: Multivariate Probabilistic Time Series Forecasting with
  Continuous Energy-based Generative Models
ScoreGrad: Multivariate Probabilistic Time Series Forecasting with Continuous Energy-based Generative Models
Tijin Yan
Hongwei Zhang
Tong Zhou
Yufeng Zhan
Yuanqing Xia
DiffMAI4TS
88
40
0
18 Jun 2021
Novelty Detection via Contrastive Learning with Negative Data
  Augmentation
Novelty Detection via Contrastive Learning with Negative Data Augmentation
Chengwei Chen
Yuan Xie
Shaohui Lin
Ruizhi Qiao
Jingren Zhou
Xin Tan
Yi Zhang
Lizhuang Ma
SSL
84
13
0
18 Jun 2021
WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Nanxin Chen
Yu Zhang
Heiga Zen
Ron J. Weiss
Mohammad Norouzi
Najim Dehak
William Chan
DiffM
102
88
0
17 Jun 2021
SCINet: Time Series Modeling and Forecasting with Sample Convolution and
  Interaction
SCINet: Time Series Modeling and Forecasting with Sample Convolution and Interaction
Minhao Liu
Ailing Zeng
Mu-Hwa Chen
Zhijian Xu
Qiuxia Lai
Lingna Ma
Qiang Xu
AI4TS
131
389
0
17 Jun 2021
Joining datasets via data augmentation in the label space for neural
  networks
Joining datasets via data augmentation in the label space for neural networks
Jake Zhao
Mingfeng Ou
Linji Xue
Yunkai Cui
Sai Wu
Gang Chen
36
2
0
17 Jun 2021
Input Invex Neural Network
Input Invex Neural Network
Suman Sapkota
Binod Bhattarai
50
4
0
16 Jun 2021
Detecting message modification attacks on the CAN bus with Temporal
  Convolutional Networks
Detecting message modification attacks on the CAN bus with Temporal Convolutional Networks
I. Chiscop
András Gazdag
Joost Bosman
G. Biczók
AAML
34
8
0
16 Jun 2021
Improving the expressiveness of neural vocoding with non-affine
  Normalizing Flows
Improving the expressiveness of neural vocoding with non-affine Normalizing Flows
Adam Gabry's
Yunlong Jiao
V. Klimkov
Daniel Korzekwa
Roberto Barra-Chicote
50
1
0
16 Jun 2021
Global Rhythm Style Transfer Without Text Transcriptions
Global Rhythm Style Transfer Without Text Transcriptions
Kaizhi Qian
Yang Zhang
Shiyu Chang
Jinjun Xiong
Chuang Gan
David D. Cox
M. Hasegawa-Johnson
81
20
0
16 Jun 2021
WSRGlow: A Glow-based Waveform Generative Model for Audio
  Super-Resolution
WSRGlow: A Glow-based Waveform Generative Model for Audio Super-Resolution
Kexun Zhang
Yi Ren
Changliang Xu
Zhou Zhao
107
30
0
16 Jun 2021
Gradient Forward-Propagation for Large-Scale Temporal Video Modelling
Gradient Forward-Propagation for Large-Scale Temporal Video Modelling
Mateusz Malinowski
Dimitrios Vytiniotis
G. Swirszcz
Viorica Patraucean
João Carreira
65
8
0
15 Jun 2021
NeuroBEM: Hybrid Aerodynamic Quadrotor Model
NeuroBEM: Hybrid Aerodynamic Quadrotor Model
L. Bauersfeld
Elia Kaufmann
Philipp Foehn
Sihao Sun
Davide Scaramuzza
141
93
0
15 Jun 2021
Learning to Compensate: A Deep Neural Network Framework for 5G Power
  Amplifier Compensation
Learning to Compensate: A Deep Neural Network Framework for 5G Power Amplifier Compensation
Po-Yu Chen
Hao-Wei Chen
Yi-Min Tsai
Hsien-Kai Kuo
Hantao Huang
Hsin-Hung Chen
Sheng-Hong Yan
Wei-Lun Ou
Chia-Ming Cheng
38
3
0
15 Jun 2021
UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram
  Discriminators for High-Fidelity Waveform Generation
UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation
Won Jang
D. Lim
Jaesam Yoon
Bongwan Kim
Juntae Kim
119
132
0
15 Jun 2021
WaveNet-Based Deep Neural Networks for the Characterization of Anomalous
  Diffusion (WADNet)
WaveNet-Based Deep Neural Networks for the Characterization of Anomalous Diffusion (WADNet)
Dezhong Li
Qiujin Yao
Zihan Huang
DiffM
61
19
0
14 Jun 2021
Unsupervised Learning of Visual 3D Keypoints for Control
Unsupervised Learning of Visual 3D Keypoints for Control
Boyuan Chen
Pieter Abbeel
Deepak Pathak
3DPCSSL
92
40
0
14 Jun 2021
Hierarchically Regularized Deep Forecasting
Hierarchically Regularized Deep Forecasting
Biswajit Paria
Rajat Sen
Amr Ahmed
Abhimanyu Das
AI4TS
70
16
0
14 Jun 2021
Non Gaussian Denoising Diffusion Models
Non Gaussian Denoising Diffusion Models
Eliya Nachmani
Robin San Roman
Lior Wolf
VLMDiffM
83
50
0
14 Jun 2021
CRASH: Raw Audio Score-based Generative Modeling for Controllable
  High-resolution Drum Sound Synthesis
CRASH: Raw Audio Score-based Generative Modeling for Controllable High-resolution Drum Sound Synthesis
Simon Rouard
Gaëtan Hadjeres
DiffM
47
43
0
14 Jun 2021
Continuous Wavelet Vocoder-based Decomposition of Parametric Speech
  Waveform Synthesis
Continuous Wavelet Vocoder-based Decomposition of Parametric Speech Waveform Synthesis
M. S. Al-Radhi
Tamás Gábor Csapó
Csaba Zainkó
Géza Németh
31
3
0
12 Jun 2021
D2C: Diffusion-Denoising Models for Few-shot Conditional Generation
D2C: Diffusion-Denoising Models for Few-shot Conditional Generation
Abhishek Sinha
Jiaming Song
Chenlin Meng
Stefano Ermon
VLMDiffM
140
121
0
12 Jun 2021
Catch-A-Waveform: Learning to Generate Audio from a Single Short Example
Catch-A-Waveform: Learning to Generate Audio from a Single Short Example
Gal Greshler
Tamar Rott Shaham
T. Michaeli
104
25
0
11 Jun 2021
PriorGrad: Improving Conditional Denoising Diffusion Models with
  Data-Dependent Adaptive Prior
PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Dependent Adaptive Prior
Sang-gil Lee
Heeseung Kim
Chaehun Shin
Xu Tan
Chang-Shu Liu
Qi Meng
Tao Qin
Wei Chen
Sung-Hoon Yoon
Tie-Yan Liu
DiffM
85
89
0
11 Jun 2021
HUI-Audio-Corpus-German: A high quality TTS dataset
HUI-Audio-Corpus-German: A high quality TTS dataset
Pascal Puchtler
Johannes Wirth
René Peinl
65
22
0
11 Jun 2021
Sprachsynthese -- State-of-the-Art in englischer und deutscher Sprache
Sprachsynthese -- State-of-the-Art in englischer und deutscher Sprache
René Peinl
50
0
0
11 Jun 2021
Monotonic Neural Network: combining Deep Learning with Domain Knowledge
  for Chiller Plants Energy Optimization
Monotonic Neural Network: combining Deep Learning with Domain Knowledge for Chiller Plants Energy Optimization
Fanhe Ma
Faen Zhang
Shenglan Ben
Shuxin Qin
Pengcheng Zhou
Changsheng Zhou
Fengyi Xu
60
0
0
11 Jun 2021
Conditional Variational Autoencoder with Adversarial Learning for
  End-to-End Text-to-Speech
Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Jaehyeon Kim
Jungil Kong
Juhee Son
DRL
170
903
0
11 Jun 2021
Fair Normalizing Flows
Fair Normalizing Flows
Mislav Balunović
Anian Ruoss
Martin Vechev
AAML
66
38
0
10 Jun 2021
Improving multi-speaker TTS prosody variance with a residual encoder and
  normalizing flows
Improving multi-speaker TTS prosody variance with a residual encoder and normalizing flows
Iván Vallés-Pérez
Julian Roth
Grzegorz Beringer
Roberto Barra-Chicote
J. Droppo
102
8
0
10 Jun 2021
A Survey of Transformers
A Survey of Transformers
Tianyang Lin
Yuxin Wang
Xiangyang Liu
Xipeng Qiu
ViT
208
1,150
0
08 Jun 2021
Speech BERT Embedding For Improving Prosody in Neural TTS
Speech BERT Embedding For Improving Prosody in Neural TTS
Liping Chen
Yan Deng
Xi Wang
Frank Soong
Lei He
89
23
0
08 Jun 2021
NWT: Towards natural audio-to-video generation with representation
  learning
NWT: Towards natural audio-to-video generation with representation learning
Rayhane Mama
Marc S. Tyndel
Hashiam Kadhim
Cole Clifford
Ragavan Thurairatnam
VGen
112
12
0
08 Jun 2021
The Medkit-Learn(ing) Environment: Medical Decision Modelling through
  Simulation
The Medkit-Learn(ing) Environment: Medical Decision Modelling through Simulation
Alex J. Chan
Ioana Bica
Alihan Huyuk
Daniel Jarrett
M. Schaar
90
14
0
08 Jun 2021
Tabular Data: Deep Learning is Not All You Need
Tabular Data: Deep Learning is Not All You Need
Ravid Shwartz-Ziv
Amitai Armon
LMTD
205
1,304
0
06 Jun 2021
Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
Dong Min
Dong Bok Lee
Eunho Yang
Sung Ju Hwang
136
175
0
06 Jun 2021
The Image Local Autoregressive Transformer
The Image Local Autoregressive Transformer
Chenjie Cao
Yue Hong
Xiang Li
Chengrong Wang
C. Xu
Xiangyang Xue
Yanwei Fu
82
13
0
04 Jun 2021
Fre-GAN: Adversarial Frequency-consistent Audio Synthesis
Fre-GAN: Adversarial Frequency-consistent Audio Synthesis
Ji-Hoon Kim
Sang-Hoon Lee
Ji-Hyun Lee
Seong-Whan Lee
106
54
0
04 Jun 2021
An Improved Model for Voicing Silent Speech
An Improved Model for Voicing Silent Speech
David Gaddy
Dana Klein
72
34
0
03 Jun 2021
Drivers' Manoeuvre Modelling and Prediction for Safe HRI
Drivers' Manoeuvre Modelling and Prediction for Safe HRI
Erwin Jose López Pulgarín
G. Herrmann
U. Leonards
31
0
0
03 Jun 2021
NVC-Net: End-to-End Adversarial Voice Conversion
NVC-Net: End-to-End Adversarial Voice Conversion
Bac Nguyen Cong
Fabien Cardinaux
AAML
128
42
0
02 Jun 2021
Deep Personalized Glucose Level Forecasting Using Attention-based
  Recurrent Neural Networks
Deep Personalized Glucose Level Forecasting Using Attention-based Recurrent Neural Networks
Mohammadreza Armandpour
Brian Kidd
Yu Du
Jianhua Z. Huang
81
15
0
02 Jun 2021
Previous
123...293031...606162
Next