Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.03499
Cited By
WaveNet: A Generative Model for Raw Audio
12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"WaveNet: A Generative Model for Raw Audio"
50 / 3,039 papers shown
Title
Machine Learning for Microcontroller-Class Hardware: A Review
Swapnil Sayan Saha
S. Sandha
Mani B. Srivastava
37
119
0
29 May 2022
Deep Learning-based Spatially Explicit Emulation of an Agent-Based Simulator for Pandemic in a City
Varun Madhavan
Adway Mitra
P. Chakrabarti
AI4CE
20
0
0
28 May 2022
Group-level Brain Decoding with Deep Learning
Richard Csaky
M. Es
Oiwi Parker Jones
M. Woolrich
33
11
0
27 May 2022
Do we really need temporal convolutions in action segmentation?
Dazhao Du
Fuchun Sun
Yu Li
Zhongang Qi
Hui Xiong
Ying Shan
ViT
39
16
0
26 May 2022
TDASS: Target Domain Adaptation Speech Synthesis Framework for Multi-speaker Low-Resource TTS
Xulong Zhang
Jianzong Wang
Ning Cheng
Jing Xiao
27
14
0
24 May 2022
Singer Identification for Metaverse with Timbral and Middle-Level Perceptual Features
Xulong Zhang
Jianzong Wang
Ning Cheng
Jing Xiao
30
16
0
24 May 2022
HiPAL: A Deep Framework for Physician Burnout Prediction Using Activity Logs in Electronic Health Records
Hanyang Liu
Sunny S. Lou
Benjamin C. Warner
Derek Harford
Thomas Kannampallil
Chenyang Lu
LM&MA
HAI
37
9
0
24 May 2022
Deep Representations for Time-varying Brain Datasets
Sikun Lin
Shuyun Tang
Scott T. Grafton
Ambuj K. Singh
AI4CE
29
6
0
23 May 2022
Self-Supervised Speech Representation Learning: A Review
Abdel-rahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
...
Shang-Wen Li
Karen Livescu
Lars Maaløe
Tara N. Sainath
Shinji Watanabe
SSL
AI4TS
137
354
0
21 May 2022
Self-Supervised Time Series Representation Learning via Cross Reconstruction Transformer
Wen-Rang Zhang
Ling Yang
Shijia Geng
Shenda Hong
ViT
AI4TS
37
42
0
20 May 2022
End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions
Wonjune Kang
M. Hasegawa-Johnson
D. Roy
42
8
0
19 May 2022
Improving Robustness against Real-World and Worst-Case Distribution Shifts through Decision Region Quantification
Leo Schwinn
Leon Bungert
A. Nguyen
René Raab
Falk Pulsmeyer
Doina Precup
Björn Eskofier
Dario Zanca
OOD
56
14
0
19 May 2022
Cross-Enhancement Transformer for Action Segmentation
Jiahui Wang
Zhenyou Wang
Shanna Zhuang
Hui Wang
ViT
59
23
0
19 May 2022
Macedonian Speech Synthesis for Assistive Technology Applications
B. Sofronievski
Elena Velovska
Martin Velichkovski
Violeta Argirova
Tea Veljkovikj
...
Kristijan Lazarev
Toni Bachvarovski
Z. Ivanovski
Dimitar Tashkovski
B. Gerazov
16
0
0
18 May 2022
Spatial-Temporal Interactive Dynamic Graph Convolution Network for Traffic Forecasting
Aoyun Liu
Yaying Zhang
GNN
AI4TS
26
31
0
18 May 2022
HARNet: A Convolutional Neural Network for Realized Volatility Forecasting
Rafael Reisenhofer
Xandro Bayer
N. Hautsch
40
8
0
16 May 2022
cMelGAN: An Efficient Conditional Generative Model Based on Mel Spectrograms
Tracy Qian
Jackson Kaunismaa
Tony Chung
MGen
GAN
MedIm
21
5
0
15 May 2022
GAN-Aimbots: Using Machine Learning for Cheating in First Person Shooters
Anssi Kanervisto
Tomi Kinnunen
Ville Hautamaki
22
13
0
14 May 2022
A Generalist Agent
Scott E. Reed
Konrad Zolna
Emilio Parisotto
Sergio Gomez Colmenarejo
Alexander Novikov
...
Yutian Chen
R. Hadsell
Oriol Vinyals
Mahyar Bordbar
Nando de Freitas
LM&Ro
LLMAG
AI4CE
124
794
0
12 May 2022
Unified Source-Filter GAN with Harmonic-plus-Noise Source Excitation Generation
Reo Yoneyama
Yi-Chiao Wu
Tomoki Toda
30
14
0
12 May 2022
Robot Cooking with Stir-fry: Bimanual Non-prehensile Manipulation of Semi-fluid Objects
Junjia Liu
Yiting Chen
Zhipeng Dong
Shixiong Wang
Sylvain Calinon
Miao Li
Fei Chen
37
60
0
12 May 2022
Real-Time Packet Loss Concealment With Mixed Generative and Predictive Model
J. Valin
Ahmed Mustafa
Christopher Montgomery
Timothy B. Terriberry
Michael Klingbeil
Paris Smaragdis
A. Krishnaswamy
30
18
0
11 May 2022
Efficient Automated Deep Learning for Time Series Forecasting
Difan Deng
Florian Karl
Frank Hutter
Bernd Bischl
Marius Lindauer
AI4TS
59
16
0
11 May 2022
Towards Improved Zero-shot Voice Conversion with Conditional DSVAE
Jiachen Lian
Chunlei Zhang
Gopala Krishna Anumanchipalli
Dong Yu
34
23
0
11 May 2022
Deep Learning Enabled Semantic Communications with Speech Recognition and Synthesis
Zhenzi Weng
Zhijin Qin
Xiaoming Tao
Chengkang Pan
Guangyi Liu
Geoffrey Ye Li
44
132
0
09 May 2022
NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level Quality
Xu Tan
Jiawei Chen
Haohe Liu
Jian Cong
Chen Zhang
...
Lei He
Frank Soong
Tao Qin
Sheng Zhao
Tie-Yan Liu
57
213
0
09 May 2022
Cross-Utterance Conditioned VAE for Non-Autoregressive Text-to-Speech
Yongqian Li
Cheng Yu
Guangzhi Sun
Hua Jiang
Fanglei Sun
Weiqin Zu
Ying Wen
Yang Yang
Jun Wang
34
7
0
09 May 2022
Synthetic Data -- what, why and how?
James Jordon
Lukasz Szpruch
F. Houssiau
M. Bottarelli
Giovanni Cherubin
Carsten Maple
Samuel N. Cohen
Adrian Weller
51
109
0
06 May 2022
GANimator: Neural Motion Synthesis from a Single Sequence
Peizhuo Li
Kfir Aberman
Zihan Zhang
Rana Hanocka
O. Sorkine-Hornung
GAN
16
33
0
05 May 2022
MAD: Self-Supervised Masked Anomaly Detection Task for Multivariate Time Series
Yiwei Fu
Feng Xue
AI4TS
29
15
0
04 May 2022
SVTS: Scalable Video-to-Speech Synthesis
Rodrigo Mira
A. Haliassos
Stavros Petridis
Björn W. Schuller
Maja Pantic
22
32
0
04 May 2022
TartanDrive: A Large-Scale Dataset for Learning Off-Road Dynamics Models
S. Triest
Matthew Sivaprakasam
Sean J. Wang
Wenshan Wang
Aaron M. Johnson
Sebastian Scherer
35
55
0
03 May 2022
The ICML 2022 Expressive Vocalizations Workshop and Competition: Recognizing, Generating, and Personalizing Vocal Bursts
Alice Baird
Panagiotis Tzirakis
Gauthier Gidel
Marco Jiralerspong
Eilif B. Muller
Kory W. Mathewson
Björn Schuller
Min Zhang
D. Keltner
Alan S. Cowen
VLM
36
30
0
03 May 2022
HarmoF0: Logarithmic Scale Dilated Convolution For Pitch Estimation
Weixing Wei
P. Li
Yi Yu
Wei Li
25
17
0
02 May 2022
A Novel Speech-Driven Lip-Sync Model with CNN and LSTM
Xiaohong Li
Xiang Wang
Kai Wang
Kai Wang
21
4
0
02 May 2022
Short-Term Density Forecasting of Low-Voltage Load using Bernstein-Polynomial Normalizing Flows
M. Arpogaus
Marcus Voss
Beate Sick
Mark Nigge-Uricher
Oliver Durr
33
16
0
29 Apr 2022
Regotron: Regularizing the Tacotron2 architecture via monotonic alignment loss
Efthymios Georgiou
Kosmas Kritsis
Georgios Paraskevopoulos
Athanasios Katsamanis
Vassilis Katsouros
Alexandros Potamianos
23
3
0
28 Apr 2022
Parallel Synthesis for Autoregressive Speech Generation
Po-Chun Hsu
Da-Rong Liu
Andy T. Liu
Hung-yi Lee
42
5
0
25 Apr 2022
SyntaSpeech: Syntax-Aware Generative Adversarial Text-to-Speech
Zhenhui Ye
Zhou Zhao
Yi Ren
Fei Wu
46
27
0
25 Apr 2022
PhysioGAN: Training High Fidelity Generative Model for Physiological Sensor Readings
M. Alzantot
L. Garcia
Mani B. Srivastava
27
1
0
25 Apr 2022
Improving Self-Supervised Learning-based MOS Prediction Networks
Bálint Gyires-Tóth
Csaba Zainkó
SSL
16
1
0
23 Apr 2022
Sequence-Based Target Coin Prediction for Cryptocurrency Pump-and-Dump
Sihao Hu
Zhen Zhang
Shengliang Lu
Bingsheng He
Zhao Li
AI4TS
24
16
0
21 Apr 2022
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis
Rongjie Huang
Max W. Y. Lam
Jun Wang
Dan Su
Dong Yu
Yi Ren
Zhou Zhao
DiffM
28
166
0
21 Apr 2022
STFT-Domain Neural Speech Enhancement with Very Low Algorithmic Latency
Zhong-Qiu Wang
Gordon Wichern
Shinji Watanabe
Jonathan Le Roux
38
36
0
21 Apr 2022
Scale Dependencies and Self-Similar Models with Wavelet Scattering Spectra
Rudy Morel
G. Rochette
R. Leonarduzzi
J. Bouchaud
S. Mallat
26
14
0
19 Apr 2022
Approaching sales forecasting using recurrent neural networks and transformers
Iván Vallés-Pérez
E. Soria-Olivas
M. Martínez-Sober
Antonio J. Serrano
J. Gómez-Sanchís
Fernando Mateo
AI4TS
29
36
0
16 Apr 2022
Efficient Architecture Search for Diverse Tasks
Jun Shen
M. Khodak
Ameet Talwalkar
30
31
0
15 Apr 2022
Diagnosing and Fixing Manifold Overfitting in Deep Generative Models
Gabriel Loaiza-Ganem
Brendan Leigh Ross
Jesse C. Cresswell
Anthony L. Caterini
GAN
DRL
24
28
0
14 Apr 2022
Do You Really Mean That? Content Driven Audio-Visual Deepfake Dataset and Multimodal Method for Temporal Forgery Localization
Zhixi Cai
Kalin Stefanov
Abhinav Dhall
Munawar Hayat
27
3
0
13 Apr 2022
A Post Auto-regressive GAN Vocoder Focused on Spectrum Fracture
Zhe-ming Lu
Mengnan He
Ruixiong Zhang
Caixia Gong
GAN
19
2
0
12 Apr 2022
Previous
1
2
3
...
20
21
22
...
59
60
61
Next