Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.03499
Cited By
v1
v2 (latest)
WaveNet: A Generative Model for Raw Audio
12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"WaveNet: A Generative Model for Raw Audio"
50 / 3,082 papers shown
Title
Image Compression with Product Quantized Masked Image Modeling
Alaaeldin El-Nouby
Matthew Muckley
Karen Ullrich
Ivan Laptev
Jakob Verbeek
Hervé Jégou
MQ
82
31
0
14 Dec 2022
Fully complex-valued deep learning model for visual perception
Aniruddh Sikdar
Sumanth Udupa
Suresh Sundaram
90
4
0
14 Dec 2022
Improving Accuracy Without Losing Interpretability: A ML Approach for Time Series Forecasting
Yiqi Sun
Zheng Shi
Jianshen Zhang
Yongzhi Qi
Hao Hu
Zuo-jun Shen
AI4TS
73
1
0
13 Dec 2022
MegaCRN: Meta-Graph Convolutional Recurrent Network for Spatio-Temporal Modeling
Renhe Jiang
Zhaonan Wang
Jiawei Yong
P. Jeph
Quanjun Chen
Yasumasa Kobayashi
Xuan Song
Toyotaro Suzumura
Shintaro Fukushima
GNN
BDL
AI4TS
111
4
0
12 Dec 2022
MnTTS2: An Open-Source Multi-Speaker Mongolian Text-to-Speech Synthesis Dataset
Kailin Liang
Bin Liu
Yifan Hu
Rui Liu
F. Bao
Guanglai Gao
76
1
0
11 Dec 2022
FAIR AI Models in High Energy Physics
Javier Mauricio Duarte
Haoyang Li
Avik Roy
Ruike Zhu
Eliu A. Huerta
...
Mark S. Neubauer
Sang Eon Park
M. Quinnan
R. Rusack
Zhizhen Zhao
123
9
0
09 Dec 2022
Learning Options via Compression
Yiding Jiang
Emmy Liu
Benjamin Eysenbach
Zico Kolter
Chelsea Finn
OffRL
95
14
0
08 Dec 2022
Knowledge Distillation Applied to Optical Channel Equalization: Solving the Parallelization Problem of Recurrent Connection
S. Srivallapanondh
Pedro J. Freire
B. Spinnler
N. Costa
A. Napoli
S. Turitsyn
Jaroslaw E. Prilepsky
60
12
0
08 Dec 2022
Framewise WaveGAN: High Speed Adversarial Vocoder in Time Domain with Very Low Computational Complexity
Ahmed Mustafa
J. Valin
Jan Büthe
Paris Smaragdis
Mike Goodwin
54
4
0
08 Dec 2022
GreenEyes: An Air Quality Evaluating Model based on WaveNet
Kan Huang
Kai Zhang
Ming-de Liu
26
2
0
08 Dec 2022
Short term prediction of demand for ride hailing services: A deep learning approach
Long Chen
Piyushimita
P. Thakuriah
K. Ampountolas
AI4TS
59
22
0
07 Dec 2022
Criteria for Classifying Forecasting Methods
Tim Januschowski
Jan Gasthaus
Bernie Wang
David Salinas
Valentin Flunkert
Michael Bohlke-Schneider
Laurent Callot
AI4TS
90
179
0
07 Dec 2022
Deep conv-attention model for diagnosing left bundle branch block from 12-lead electrocardiograms
Ali Sadeghi
A. Rezaee
F. Hajati
26
1
0
07 Dec 2022
A K-variate Time Series Is Worth K Words: Evolution of the Vanilla Transformer Architecture for Long-term Multivariate Time Series Forecasting
Zanwei Zhou
Rui-Ming Zhong
Chen Yang
Yan Wang
Xiaokang Yang
Wei Shen
AI4TS
68
9
0
06 Dec 2022
Lossy Compression for Robust Unsupervised Time-Series Anomaly Detection
Christophe Ley
Jorge F. Silva
OOD
CML
35
1
0
05 Dec 2022
Evince the artifacts of Spoof Speech by blending Vocal Tract and Voice Source Features
T. U. K. Reddy
Sahukari Chaitanya Varun
Kota Pranav Kumar Sankala Sreekanth
K. Murty
117
0
0
05 Dec 2022
Deep neural network techniques for monaural speech enhancement: state of the art analysis
P. Ochieng
119
23
0
01 Dec 2022
ConvLab-3: A Flexible Dialogue System Toolkit Based on a Unified Data Format
Qi Zhu
Christian Geishauser
Hsien-Chin Lin
Carel van Niekerk
Baolin Peng
...
Dazhen Wan
Xiaochen Zhu
Jianfeng Gao
Milica Gavsić
Minlie Huang
108
23
0
30 Nov 2022
Extreme Audio Time Stretching Using Neural Synthesis
Leonardo Fierro
Alec Wright
Vesa Valimaki
Matti Hämäläinen
57
1
0
30 Nov 2022
SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speaker text-to-speech
Byoung Jin Choi
Myeonghun Jeong
Joun Yeop Lee
N. Kim
104
13
0
30 Nov 2022
AirFormer: Predicting Nationwide Air Quality in China with Transformers
Yuxuan Liang
Yutong Xia
Songyu Ke
Yiwei Wang
Qingsong Wen
Junbo Zhang
Yu Zheng
Roger Zimmermann
AI4TS
AI4CE
73
119
0
29 Nov 2022
Lipschitz constant estimation for 1D convolutional neural networks
Patricia Pauli
Dennis Gramlich
Frank Allgöwer
53
13
0
28 Nov 2022
Traditional Classification Neural Networks are Good Generators: They are Competitive with DDPMs and GANs
Guangrun Wang
Philip Torr
83
9
0
27 Nov 2022
Spatio-Temporal Meta-Graph Learning for Traffic Forecasting
Renhe Jiang
Zhaonan Wang
Jiawei Yong
P. Jeph
Quanjun Chen
Yasumasa Kobayashi
Xuan Song
Shintaro Fukushima
Toyotaro Suzumura
AI4TS
88
202
0
27 Nov 2022
A Maximum Log-Likelihood Method for Imbalanced Few-Shot Learning Tasks
Samuel Hess
G. Ditzler
147
2
0
26 Nov 2022
Deep Fake Detection, Deterrence and Response: Challenges and Opportunities
Amin Azmoodeh
Ali Dehghantanha
83
3
0
26 Nov 2022
A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation
Hiroki Furuta
Yusuke Iwasawa
Yutaka Matsuo
S. Gu
83
17
0
25 Nov 2022
Puffin: pitch-synchronous neural waveform generation for fullband speech on modest devices
O. Watts
Lovisa Wihlborg
Cassia Valentini-Botinhao
73
3
0
25 Nov 2022
Efficient Incremental Text-to-Speech on GPUs
Muyang Du
Chuan Liu
Jiaxing Qi
Junjie Lai
52
1
0
25 Nov 2022
Generative Modeling in Sinogram Domain for Sparse-view CT Reconstruction
Bing Guan
Cailian Yang
Liu Zhang
S. Niu
Minghui Zhang
Yuhao Wang
Weiwen Wu
Qiegen Liu
DiffM
MedIm
92
44
0
25 Nov 2022
HouseDiffusion: Vector Floorplan Generation via a Diffusion Model with Discrete and Continuous Denoising
M. Shabani
Sepidehsadat Hosseini
Yasutaka Furukawa
DiffM
85
66
0
23 Nov 2022
Embedding a Differentiable Mel-cepstral Synthesis Filter to a Neural Speech Synthesis System
Takenori Yoshimura
Shinji Takaki
Kazuhiro Nakamura
Keiichiro Oura
Yukiya Hono
Kei Hashimoto
Yoshihiko Nankaku
K. Tokuda
69
7
0
21 Nov 2022
VarietySound: Timbre-Controllable Video to Sound Generation via Unsupervised Information Disentanglement
Chenye Cui
Yi Ren
Jinglin Liu
Rongjie Huang
Zhou Zhao
VGen
88
14
0
19 Nov 2022
Step Counting with Attention-based LSTM
Shehroz S. Khan
Ali Abedi
HAI
36
8
0
18 Nov 2022
Listen, Denoise, Action! Audio-Driven Motion Synthesis with Diffusion Models
Simon Alexanderson
Rajmund Nagy
Jonas Beskow
G. Henter
DiffM
VGen
81
174
0
17 Nov 2022
Towards Building Text-To-Speech Systems for the Next Billion Users
Gokul Karthik Kumar
V. PraveenS.
Pratyush Kumar
Mitesh M. Khapra
Karthik Nandakumar
92
22
0
17 Nov 2022
A Review of Intelligent Music Generation Systems
Lei Wang
Ziyi Zhao
Han Liu
Junwei Pang
Yi-qiang Qin
Qidi Wu
MGen
77
35
0
16 Nov 2022
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model
Xingqian Xu
Zhangyang Wang
Eric Zhang
Kai Wang
Humphrey Shi
DiffM
186
198
0
15 Nov 2022
General Intelligence Requires Rethinking Exploration
Minqi Jiang
Tim Rocktaschel
Edward Grefenstette
LRM
83
20
0
15 Nov 2022
Diffusion Models for Medical Image Analysis: A Comprehensive Survey
Amirhossein Kazerouni
Ehsan Khodapanah Aghdam
Moein Heidari
Reza Azad
Mohsen Fayyaz
Ilker Hacihaliloglu
Dorit Merhof
DiffM
MedIm
137
399
0
14 Nov 2022
Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
J. Webber
Cassia Valentini-Botinhao
Evelyn Williams
G. Henter
Simon King
111
9
0
13 Nov 2022
HigeNet: A Highly Efficient Modeling for Long Sequence Time Series Prediction in AIOps
Jiajia Li
Feng Tan
Cheng He
Zikai Wang
Haitao Song
Lingfei Wu
Pengwei Hu
60
0
0
13 Nov 2022
Online Phase Reconstruction via DNN-based Phase Differences Estimation
Yoshiki Masuyama
Kohei Yatabe
Kento Nagatomo
Yasuhiro Oikawa
3DV
86
8
0
12 Nov 2022
GANStrument: Adversarial Instrument Sound Synthesis with Pitch-invariant Instance Conditioning
Gaku Narita
Junichi Shimizu
Taketo Akama
GAN
82
11
0
10 Nov 2022
Accented Text-to-Speech Synthesis with a Conditional Variational Autoencoder
J. Melechovský
Ambuj Mehrish
Berrak Sisman
Dorien Herremans
83
6
0
07 Nov 2022
I Hear Your True Colors: Image Guided Audio Generation
Roy Sheffer
Yossi Adi
VLM
85
76
0
06 Nov 2022
Self-Supervised Learning for Speech Enhancement through Synthesis
Bryce Irvin
Marko Stamenovic
M. Kegler
Li-Chia Yang
95
21
0
04 Nov 2022
Cold Diffusion for Speech Enhancement
Hao Yen
François Germain
Gordon Wichern
Jonathan Le Roux
DiffM
96
45
0
04 Nov 2022
Real-Time Target Sound Extraction
Bandhav Veluri
Justin Chan
Malek Itani
Tuochao Chen
Takuya Yoshioka
Shyamnath Gollakota
114
33
0
04 Nov 2022
Translated Skip Connections -- Expanding the Receptive Fields of Fully Convolutional Neural Networks
Joshua Bruton
Hairong Wang
SSeg
31
4
0
03 Nov 2022
Previous
1
2
3
...
16
17
18
...
60
61
62
Next