ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.03499
  4. Cited By
WaveNet: A Generative Model for Raw Audio
v1v2 (latest)

WaveNet: A Generative Model for Raw Audio

12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
    DiffM
ArXiv (abs)PDFHTML

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,082 papers shown
Title
A Daily Tourism Demand Prediction Framework Based on Multi-head
  Attention CNN: The Case of The Foreign Entrant in South Korea
A Daily Tourism Demand Prediction Framework Based on Multi-head Attention CNN: The Case of The Foreign Entrant in South Korea
Dong-Keon Kim
S. K. Shyn
Donghee Kim
Seungwoo Jang
Kwangsu Kim
AI4TS
33
7
0
01 Dec 2021
Improving Deep Learning Interpretability by Saliency Guided Training
Improving Deep Learning Interpretability by Saliency Guided Training
Aya Abdelsalam Ismail
H. C. Bravo
Soheil Feizi
FAtt
105
83
0
29 Nov 2021
How Deep Are the Fakes? Focusing on Audio Deepfake: A Survey
How Deep Are the Fakes? Focusing on Audio Deepfake: A Survey
Zahra Khanjani
Gabrielle Watson
V. P Janeja
61
27
0
28 Nov 2021
When Creators Meet the Metaverse: A Survey on Computational Arts
When Creators Meet the Metaverse: A Survey on Computational Arts
Lik-Hang Lee
Zijun Lin
Rui Hu
Zhengya Gong
Abhishek Kumar
Tangyao Li
Sijia Li
Pan Hui
80
88
0
26 Nov 2021
A-Muze-Net: Music Generation by Composing the Harmony based on the
  Generated Melody
A-Muze-Net: Music Generation by Composing the Harmony based on the Generated Melody
Orly Goren
Eliya Nachmani
Lior Wolf
72
4
0
25 Nov 2021
Spatio-Temporal Joint Graph Convolutional Networks for Traffic
  Forecasting
Spatio-Temporal Joint Graph Convolutional Networks for Traffic Forecasting
Chuanpan Zheng
Xiaoliang Fan
Shirui Pan
Haibing Jin
Zhaopeng Peng
Zonghan Wu
Cheng-Yu Wang
Philip S. Yu
GNNAI4TS
64
52
0
25 Nov 2021
MM-Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual
  Event Localization and Video Parsing
MM-Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing
Jiashuo Yu
Ying Cheng
Ruiwei Zhao
Rui Feng
Yuejie Zhang
103
62
0
24 Nov 2021
Guided-TTS: A Diffusion Model for Text-to-Speech via Classifier Guidance
Guided-TTS: A Diffusion Model for Text-to-Speech via Classifier Guidance
Heeseung Kim
Sungwon Kim
Sungroh Yoon
DiffMBDL
131
112
0
23 Nov 2021
Neural Fields in Visual Computing and Beyond
Neural Fields in Visual Computing and Beyond
Yiheng Xie
Towaki Takikawa
Shunsuke Saito
Or Litany
Shiqin Yan
Numair Khan
Federico Tombari
James Tompkin
Vincent Sitzmann
Srinath Sridhar
3DH
281
638
0
22 Nov 2021
Deep Spoken Keyword Spotting: An Overview
Deep Spoken Keyword Spotting: An Overview
Iván López-Espejo
Zheng-Hua Tan
John H. L. Hansen
Jesper Jensen
87
107
0
20 Nov 2021
More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech
More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech
Michael Hassid
Michelle Tadmor Ramanovich
Brendan Shillingford
Miaosen Wang
Ye Jia
Tal Remez
DiffM
72
18
0
19 Nov 2021
Differentiable Wavetable Synthesis
Differentiable Wavetable Synthesis
Siyuan Shan
Lamtharn Hantrakul
Jitong Chen
Matt Avent
David Trevelyan
115
20
0
19 Nov 2021
A transformer-based model for default prediction in mid-cap corporate
  markets
A transformer-based model for default prediction in mid-cap corporate markets
Kamesh Korangi
Christophe Mues
Cristián Bravo
AI4TS
37
31
0
18 Nov 2021
High Quality Streaming Speech Synthesis with Low,
  Sentence-Length-Independent Latency
High Quality Streaming Speech Synthesis with Low, Sentence-Length-Independent Latency
Nikolaos Ellinas
G. Vamvoukakis
K. Markopoulos
Aimilios Chalamandaris
Georgia Maniati
Panos Kakoulidis
S. Raptis
June Sig Sung
Hyoungmin Park
Pirros Tsiakoulis
139
37
0
17 Nov 2021
Online Advertising Revenue Forecasting: An Interpretable Deep Learning
  Approach
Online Advertising Revenue Forecasting: An Interpretable Deep Learning Approach
Maximilian Würfel
Qiwei Han
Maximilian Kaiser
AI4TS
45
8
0
16 Nov 2021
Towards Generating Real-World Time Series Data
Towards Generating Real-World Time Series Data
Hengzhi Pei
Kan Ren
Yuqing Yang
Chang-Shu Liu
Tao Qin
Dongsheng Li
AI4TS
112
35
0
16 Nov 2021
Video Background Music Generation with Controllable Music Transformer
Video Background Music Generation with Controllable Music Transformer
Shangzhe Di
Jiang
Sihan Liu
Zhaokai Wang
Leyan Zhu
Zexin He
Hongming Liu
Shuicheng Yan
102
95
0
16 Nov 2021
Property Inference Attacks Against GANs
Property Inference Attacks Against GANs
Junhao Zhou
Yufei Chen
Chao Shen
Yang Zhang
AAMLMIACV
112
55
0
15 Nov 2021
Skillful Twelve Hour Precipitation Forecasts using Large Context Neural
  Networks
Skillful Twelve Hour Precipitation Forecasts using Large Context Neural Networks
L. Espeholt
Shreya Agrawal
C. Sønderby
M. Kumar
Jonathan Heek
Carla Bromberg
Cenk Gazen
Jason Hickey
Aaron Bell
Nal Kalchbrenner
AI4Cl
76
48
0
14 Nov 2021
AC-VC: Non-parallel Low Latency Phonetic Posteriorgrams Based Voice
  Conversion
AC-VC: Non-parallel Low Latency Phonetic Posteriorgrams Based Voice Conversion
Damien Ronssin
Milos Cernak
78
11
0
12 Nov 2021
Learning Signal-Agnostic Manifolds of Neural Fields
Learning Signal-Agnostic Manifolds of Neural Fields
Yilun Du
Katherine M. Collins
J. Tenenbaum
Vincent Sitzmann
MedIm
90
50
0
11 Nov 2021
CAESynth: Real-Time Timbre Interpolation and Pitch Control with
  Conditional Autoencoders
CAESynth: Real-Time Timbre Interpolation and Pitch Control with Conditional Autoencoders
Aaron Valero Puche
Sukhan Lee
53
1
0
09 Nov 2021
RAVE: A variational autoencoder for fast and high-quality neural audio
  synthesis
RAVE: A variational autoencoder for fast and high-quality neural audio synthesis
Antoine Caillon
P. Esling
DRL
68
112
0
09 Nov 2021
Phantom: A High-Performance Computational Core for Sparse Convolutional
  Neural Networks
Phantom: A High-Performance Computational Core for Sparse Convolutional Neural Networks
Mahmood Azhar Qureshi
Arslan Munir
79
0
0
09 Nov 2021
SEOFP-NET: Compression and Acceleration of Deep Neural Networks for
  Speech Enhancement Using Sign-Exponent-Only Floating-Points
SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points
Yu-Chen Lin
Cheng Yu
Y. Hsu
Szu-Wei Fu
Yu Tsao
Tei-Wei Kuo
33
6
0
08 Nov 2021
Speaker Generation
Speaker Generation
Daisy Stanton
Matt Shannon
Soroosh Mariooryad
RJ Skerry-Ryan
Eric Battenberg
Tom Bagby
David Kao
96
30
0
07 Nov 2021
Meta-TTS: Meta-Learning for Few-Shot Speaker Adaptive Text-to-Speech
Meta-TTS: Meta-Learning for Few-Shot Speaker Adaptive Text-to-Speech
Sung-Feng Huang
Chyi-Jiunn Lin
Da-Rong Liu
Yi-Chen Chen
Hung-yi Lee
128
57
0
07 Nov 2021
Emotional Prosody Control for Speech Generation
Emotional Prosody Control for Speech Generation
S. Sivaprasad
Saiteja Kosgi
Vineet Gandhi
63
17
0
07 Nov 2021
Multi-Scale Semantics-Guided Neural Networks for Efficient
  Skeleton-Based Human Action Recognition
Multi-Scale Semantics-Guided Neural Networks for Efficient Skeleton-Based Human Action Recognition
Pengfei Zhang
Cuiling Lan
Wenjun Zeng
Junliang Xing
Jianru Xue
Nanning Zheng
82
6
0
07 Nov 2021
Development of a robust cascaded architecture for intelligent robot
  grasping using limited labelled data
Development of a robust cascaded architecture for intelligent robot grasping using limited labelled data
Priya Shukla
V. Kushwaha
G. C. Nandi
42
4
0
06 Nov 2021
A Bayesian generative neural network framework for epidemic inference
  problems
A Bayesian generative neural network framework for epidemic inference problems
I. Biazzo
Alfredo Braunstein
Luca DallÁsta
Fabio Mazza
121
16
0
05 Nov 2021
Generating Diverse Realistic Laughter for Interactive Art
Generating Diverse Realistic Laughter for Interactive Art
Mehdi Park Eric Paquette Étienne Gidel Gauthier Mathewso Afsar
Eric Park
Étienne Paquette
Gauthier Gidel
Kory W. Mathewson
Eilif B. Muller
54
7
0
04 Nov 2021
WaveFake: A Data Set to Facilitate Audio Deepfake Detection
WaveFake: A Data Set to Facilitate Audio Deepfake Detection
Joel Frank
Lea Schonherr
DiffM
204
131
0
04 Nov 2021
A Comparison of Discrete and Soft Speech Units for Improved Voice
  Conversion
A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
Benjamin van Niekerk
M. Carbonneau
Julian Zaïdi
Matthew Baas
Hugo Seuté
Herman Kamper
DRL
119
123
0
03 Nov 2021
Weight, Block or Unit? Exploring Sparsity Tradeoffs for Speech
  Enhancement on Tiny Neural Accelerators
Weight, Block or Unit? Exploring Sparsity Tradeoffs for Speech Enhancement on Tiny Neural Accelerators
Marko Stamenovic
Nils L. Westhausen
Li-Chia Yang
Carl R. Jensen
Alex Pawlicki
68
11
0
03 Nov 2021
Trajectory Prediction with Graph-based Dual-scale Context Fusion
Trajectory Prediction with Graph-based Dual-scale Context Fusion
Lu Zhang
Peiliang Li
Jing Chen
Shaojie Shen
101
31
0
02 Nov 2021
WaveSense: Efficient Temporal Convolutions with Spiking Neural Networks
  for Keyword Spotting
WaveSense: Efficient Temporal Convolutions with Spiking Neural Networks for Keyword Spotting
Philipp Weidel
Sadique Sheik
45
15
0
02 Nov 2021
Don't Generate Me: Training Differentially Private Generative Models
  with Sinkhorn Divergence
Don't Generate Me: Training Differentially Private Generative Models with Sinkhorn Divergence
Tianshi Cao
Alex Bie
Arash Vahdat
Sanja Fidler
Karsten Kreis
SyDaDiffM
89
72
0
01 Nov 2021
RefineGAN: Universally Generating Waveform Better than Ground Truth with
  Highly Accurate Pitch and Intensity Responses
RefineGAN: Universally Generating Waveform Better than Ground Truth with Highly Accurate Pitch and Intensity Responses
Shengyuan Xu
Wenxiao Zhao
Jing Guo
66
12
0
01 Nov 2021
Efficiently Modeling Long Sequences with Structured State Spaces
Efficiently Modeling Long Sequences with Structured State Spaces
Albert Gu
Karan Goel
Christopher Ré
308
1,861
0
31 Oct 2021
QDCNN: Quantum Dilated Convolutional Neural Network
QDCNN: Quantum Dilated Convolutional Neural Network
Yixiong Chen
76
4
0
29 Oct 2021
Ask "Who", Not "What": Bitcoin Volatility Forecasting with Twitter Data
Ask "Who", Not "What": Bitcoin Volatility Forecasting with Twitter Data
M. E. Akbiyik
Mert Erkul
Killian Kaempf
V. Vasiliauskaite
Nino Antulov-Fantulin
OOD
51
9
0
27 Oct 2021
Assessing Evaluation Metrics for Speech-to-Speech Translation
Assessing Evaluation Metrics for Speech-to-Speech Translation
Elizabeth Salesky
Julian Mäder
Severin Klinger
74
15
0
26 Oct 2021
Probabilistic Hierarchical Forecasting with Deep Poisson Mixtures
Probabilistic Hierarchical Forecasting with Deep Poisson Mixtures
Kin G. Olivares
N. Meetei
Ruijun Ma
Rohan Reddy
Mengfei Cao
Lee Dicker
AI4TS
109
26
0
25 Oct 2021
Neural Flows: Efficient Alternative to Neural ODEs
Neural Flows: Efficient Alternative to Neural ODEs
Marin Bilovs
Johanna Sommer
Syama Sundar Rangapuram
Tim Januschowski
Stephan Günnemann
AI4TS
80
77
0
25 Oct 2021
Actions Speak Louder than Listening: Evaluating Music Style Transfer
  based on Editing Experience
Actions Speak Louder than Listening: Evaluating Music Style Transfer based on Editing Experience
Weiyi Lu
Meng-Hsuan Wu
Yuh-ming Chiu
Li Su
60
0
0
25 Oct 2021
DelightfulTTS: The Microsoft Speech Synthesis System for Blizzard
  Challenge 2021
DelightfulTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2021
Yanqing Liu
Rui Shao
G. Wang
Kuan Chen
Bohan Li
Pong C. Yuen
Jinzhu Li
Lei He
Sheng Zhao
91
55
0
25 Oct 2021
ProtoShotXAI: Using Prototypical Few-Shot Architecture for Explainable
  AI
ProtoShotXAI: Using Prototypical Few-Shot Architecture for Explainable AI
Samuel Hess
G. Ditzler
AAML
89
1
0
22 Oct 2021
Merging Two Cultures: Deep and Statistical Learning
Merging Two Cultures: Deep and Statistical Learning
A. Bhadra
J. Datta
Nicholas G. Polson
Vadim Sokolov
Jianeng Xu
BDL
90
10
0
22 Oct 2021
A Data-Centric Optimization Framework for Machine Learning
A Data-Centric Optimization Framework for Machine Learning
Oliver Rausch
Tal Ben-Nun
Nikoli Dryden
Andrei Ivanov
Shigang Li
Torsten Hoefler
AI4CE
59
16
0
20 Oct 2021
Previous
123...252627...606162
Next