Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.03499
Cited By
v1
v2 (latest)
WaveNet: A Generative Model for Raw Audio
12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"WaveNet: A Generative Model for Raw Audio"
50 / 3,082 papers shown
Title
Distilling the Knowledge from Conditional Normalizing Flows
Dmitry Baranchuk
Vladimir Aliev
Artem Babenko
BDL
85
2
0
24 Jun 2021
Speech is Silver, Silence is Golden: What do ASVspoof-trained Models Really Learn?
Nicolas Müller
Franziska Dieckmann
Pavel Czempin
Roman Canals
Konstantin Böttinger
Jennifer Williams
106
71
0
23 Jun 2021
Recurrent Neural Network from Adder's Perspective: Carry-lookahead RNN
Haowei Jiang
Fei-wei Qin
Jin Cao
Yong Peng
Yanli Shao
LRM
ODL
68
43
0
22 Jun 2021
UniTTS: Residual Learning of Unified Embedding Space for Speech Style Control
M. Kang
Sungjae Kim
Injung Kim
77
3
0
21 Jun 2021
Non-native English lexicon creation for bilingual speech synthesis
Arun Baby
Pranav Jawale
Saranya Vinnaitherthan
Sumukh Badam
Nagaraj Adiga
Sharath Adavanne
44
8
0
21 Jun 2021
Glow-WaveGAN: Learning Speech Representations from GAN-based Variational Auto-Encoder For High Fidelity Flow-based Speech Synthesis
Jian Cong
Shan Yang
Lei Xie
Jane Polak Scowcroft
DRL
112
29
0
21 Jun 2021
Low-rank Characteristic Tensor Density Estimation Part II: Compression and Latent Density Estimation
Magda Amiridi
Nikos Kargas
N. Sidiropoulos
121
11
0
20 Jun 2021
Advances in Speech Vocoding for Text-to-Speech with Continuous Parameters
M. S. Al-Radhi
Tamás Gábor Csapó
Géza Németh
13
2
0
19 Jun 2021
Deep Generative Learning via Schrödinger Bridge
Gefei Wang
Yuling Jiao
Qiang Xu
Yang Wang
Can Yang
DiffM
OT
102
103
0
19 Jun 2021
ScoreGrad: Multivariate Probabilistic Time Series Forecasting with Continuous Energy-based Generative Models
Tijin Yan
Hongwei Zhang
Tong Zhou
Yufeng Zhan
Yuanqing Xia
DiffM
AI4TS
88
40
0
18 Jun 2021
Novelty Detection via Contrastive Learning with Negative Data Augmentation
Chengwei Chen
Yuan Xie
Shaohui Lin
Ruizhi Qiao
Jingren Zhou
Xin Tan
Yi Zhang
Lizhuang Ma
SSL
84
13
0
18 Jun 2021
WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Nanxin Chen
Yu Zhang
Heiga Zen
Ron J. Weiss
Mohammad Norouzi
Najim Dehak
William Chan
DiffM
102
88
0
17 Jun 2021
SCINet: Time Series Modeling and Forecasting with Sample Convolution and Interaction
Minhao Liu
Ailing Zeng
Mu-Hwa Chen
Zhijian Xu
Qiuxia Lai
Lingna Ma
Qiang Xu
AI4TS
131
389
0
17 Jun 2021
Joining datasets via data augmentation in the label space for neural networks
Jake Zhao
Mingfeng Ou
Linji Xue
Yunkai Cui
Sai Wu
Gang Chen
36
2
0
17 Jun 2021
Input Invex Neural Network
Suman Sapkota
Binod Bhattarai
50
4
0
16 Jun 2021
Detecting message modification attacks on the CAN bus with Temporal Convolutional Networks
I. Chiscop
András Gazdag
Joost Bosman
G. Biczók
AAML
34
8
0
16 Jun 2021
Improving the expressiveness of neural vocoding with non-affine Normalizing Flows
Adam Gabry's
Yunlong Jiao
V. Klimkov
Daniel Korzekwa
Roberto Barra-Chicote
50
1
0
16 Jun 2021
Global Rhythm Style Transfer Without Text Transcriptions
Kaizhi Qian
Yang Zhang
Shiyu Chang
Jinjun Xiong
Chuang Gan
David D. Cox
M. Hasegawa-Johnson
81
20
0
16 Jun 2021
WSRGlow: A Glow-based Waveform Generative Model for Audio Super-Resolution
Kexun Zhang
Yi Ren
Changliang Xu
Zhou Zhao
107
30
0
16 Jun 2021
Gradient Forward-Propagation for Large-Scale Temporal Video Modelling
Mateusz Malinowski
Dimitrios Vytiniotis
G. Swirszcz
Viorica Patraucean
João Carreira
65
8
0
15 Jun 2021
NeuroBEM: Hybrid Aerodynamic Quadrotor Model
L. Bauersfeld
Elia Kaufmann
Philipp Foehn
Sihao Sun
Davide Scaramuzza
141
93
0
15 Jun 2021
Learning to Compensate: A Deep Neural Network Framework for 5G Power Amplifier Compensation
Po-Yu Chen
Hao-Wei Chen
Yi-Min Tsai
Hsien-Kai Kuo
Hantao Huang
Hsin-Hung Chen
Sheng-Hong Yan
Wei-Lun Ou
Chia-Ming Cheng
38
3
0
15 Jun 2021
UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation
Won Jang
D. Lim
Jaesam Yoon
Bongwan Kim
Juntae Kim
119
132
0
15 Jun 2021
WaveNet-Based Deep Neural Networks for the Characterization of Anomalous Diffusion (WADNet)
Dezhong Li
Qiujin Yao
Zihan Huang
DiffM
61
19
0
14 Jun 2021
Unsupervised Learning of Visual 3D Keypoints for Control
Boyuan Chen
Pieter Abbeel
Deepak Pathak
3DPC
SSL
92
40
0
14 Jun 2021
Hierarchically Regularized Deep Forecasting
Biswajit Paria
Rajat Sen
Amr Ahmed
Abhimanyu Das
AI4TS
70
16
0
14 Jun 2021
Non Gaussian Denoising Diffusion Models
Eliya Nachmani
Robin San Roman
Lior Wolf
VLM
DiffM
83
50
0
14 Jun 2021
CRASH: Raw Audio Score-based Generative Modeling for Controllable High-resolution Drum Sound Synthesis
Simon Rouard
Gaëtan Hadjeres
DiffM
47
43
0
14 Jun 2021
Continuous Wavelet Vocoder-based Decomposition of Parametric Speech Waveform Synthesis
M. S. Al-Radhi
Tamás Gábor Csapó
Csaba Zainkó
Géza Németh
31
3
0
12 Jun 2021
D2C: Diffusion-Denoising Models for Few-shot Conditional Generation
Abhishek Sinha
Jiaming Song
Chenlin Meng
Stefano Ermon
VLM
DiffM
140
121
0
12 Jun 2021
Catch-A-Waveform: Learning to Generate Audio from a Single Short Example
Gal Greshler
Tamar Rott Shaham
T. Michaeli
104
25
0
11 Jun 2021
PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Dependent Adaptive Prior
Sang-gil Lee
Heeseung Kim
Chaehun Shin
Xu Tan
Chang-Shu Liu
Qi Meng
Tao Qin
Wei Chen
Sung-Hoon Yoon
Tie-Yan Liu
DiffM
85
89
0
11 Jun 2021
HUI-Audio-Corpus-German: A high quality TTS dataset
Pascal Puchtler
Johannes Wirth
René Peinl
65
22
0
11 Jun 2021
Sprachsynthese -- State-of-the-Art in englischer und deutscher Sprache
René Peinl
50
0
0
11 Jun 2021
Monotonic Neural Network: combining Deep Learning with Domain Knowledge for Chiller Plants Energy Optimization
Fanhe Ma
Faen Zhang
Shenglan Ben
Shuxin Qin
Pengcheng Zhou
Changsheng Zhou
Fengyi Xu
60
0
0
11 Jun 2021
Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Jaehyeon Kim
Jungil Kong
Juhee Son
DRL
170
903
0
11 Jun 2021
Fair Normalizing Flows
Mislav Balunović
Anian Ruoss
Martin Vechev
AAML
66
38
0
10 Jun 2021
Improving multi-speaker TTS prosody variance with a residual encoder and normalizing flows
Iván Vallés-Pérez
Julian Roth
Grzegorz Beringer
Roberto Barra-Chicote
J. Droppo
102
8
0
10 Jun 2021
A Survey of Transformers
Tianyang Lin
Yuxin Wang
Xiangyang Liu
Xipeng Qiu
ViT
208
1,150
0
08 Jun 2021
Speech BERT Embedding For Improving Prosody in Neural TTS
Liping Chen
Yan Deng
Xi Wang
Frank Soong
Lei He
89
23
0
08 Jun 2021
NWT: Towards natural audio-to-video generation with representation learning
Rayhane Mama
Marc S. Tyndel
Hashiam Kadhim
Cole Clifford
Ragavan Thurairatnam
VGen
112
12
0
08 Jun 2021
The Medkit-Learn(ing) Environment: Medical Decision Modelling through Simulation
Alex J. Chan
Ioana Bica
Alihan Huyuk
Daniel Jarrett
M. Schaar
90
14
0
08 Jun 2021
Tabular Data: Deep Learning is Not All You Need
Ravid Shwartz-Ziv
Amitai Armon
LMTD
205
1,304
0
06 Jun 2021
Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
Dong Min
Dong Bok Lee
Eunho Yang
Sung Ju Hwang
136
175
0
06 Jun 2021
The Image Local Autoregressive Transformer
Chenjie Cao
Yue Hong
Xiang Li
Chengrong Wang
C. Xu
Xiangyang Xue
Yanwei Fu
82
13
0
04 Jun 2021
Fre-GAN: Adversarial Frequency-consistent Audio Synthesis
Ji-Hoon Kim
Sang-Hoon Lee
Ji-Hyun Lee
Seong-Whan Lee
106
54
0
04 Jun 2021
An Improved Model for Voicing Silent Speech
David Gaddy
Dana Klein
72
34
0
03 Jun 2021
Drivers' Manoeuvre Modelling and Prediction for Safe HRI
Erwin Jose López Pulgarín
G. Herrmann
U. Leonards
31
0
0
03 Jun 2021
NVC-Net: End-to-End Adversarial Voice Conversion
Bac Nguyen Cong
Fabien Cardinaux
AAML
128
42
0
02 Jun 2021
Deep Personalized Glucose Level Forecasting Using Attention-based Recurrent Neural Networks
Mohammadreza Armandpour
Brian Kidd
Yu Du
Jianhua Z. Huang
81
15
0
02 Jun 2021
Previous
1
2
3
...
29
30
31
...
60
61
62
Next