ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.03499
  4. Cited By
WaveNet: A Generative Model for Raw Audio
v1v2 (latest)

WaveNet: A Generative Model for Raw Audio

12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
    DiffM
ArXiv (abs)PDFHTML

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,082 papers shown
Title
Generative Modeling of Regular and Irregular Time Series Data via
  Koopman VAEs
Generative Modeling of Regular and Irregular Time Series Data via Koopman VAEs
Ilan Naiman
N. Benjamin Erichson
Pu Ren
Lbnl Michael W. Mahoney ICSI
Omri Azencot
AI4TS
92
26
0
04 Oct 2023
SEA: Sparse Linear Attention with Estimated Attention Mask
SEA: Sparse Linear Attention with Estimated Attention Mask
Heejun Lee
Jina Kim
Jeffrey Willette
Sung Ju Hwang
162
7
0
03 Oct 2023
DiffAR: Denoising Diffusion Autoregressive Model for Raw Speech Waveform
  Generation
DiffAR: Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation
Roi Benita
Michael Elad
Joseph Keshet
DiffM
115
8
0
02 Oct 2023
A Comprehensive Review of Generative AI in Healthcare
A Comprehensive Review of Generative AI in Healthcare
Yasin Shokrollahi
Sahar Yarmohammadtoosky
Matthew M. Nikahd
Pengfei Dong
Xianqi Li
Linxia Gu
MedImAI4CE
91
20
0
01 Oct 2023
UniAudio: An Audio Foundation Model Toward Universal Audio Generation
UniAudio: An Audio Foundation Model Toward Universal Audio Generation
Dongchao Yang
Jinchuan Tian
Xuejiao Tan
Rongjie Huang
Songxiang Liu
...
Jiang Bian
Xixin Wu
Zhou Zhao
Shinji Watanabe
Helen M. Meng
CVBMAuLLM
135
128
0
01 Oct 2023
AI ensemble for signal detection of higher order gravitational wave
  modes of quasi-circular, spinning, non-precessing binary black hole mergers
AI ensemble for signal detection of higher order gravitational wave modes of quasi-circular, spinning, non-precessing binary black hole mergers
Minyang Tian
Eliu A. Huerta
Huihuo Zheng
53
0
0
29 Sep 2023
MotionLM: Multi-Agent Motion Forecasting as Language Modeling
MotionLM: Multi-Agent Motion Forecasting as Language Modeling
Ari Seff
Brian Cera
Dian Chen
Mason Ng
Aurick Zhou
Nigamaa Nayakanti
Khaled S. Refaat
Rami Al-Rfou
Benjamin Sapp
78
103
0
28 Sep 2023
A Unified View of Differentially Private Deep Generative Modeling
A Unified View of Differentially Private Deep Generative Modeling
Dingfan Chen
Raouf Kerkouche
Mario Fritz
SyDa
90
5
0
27 Sep 2023
High-Fidelity Speech Synthesis with Minimal Supervision: All Using
  Diffusion Models
High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models
Chunyu Qiang
Hao Li
Yixin Tian
Yi Zhao
Ying Zhang
Longbiao Wang
Jianwu Dang
DiffM
107
2
0
27 Sep 2023
Privacy-preserving and Privacy-attacking Approaches for Speech and Audio
  -- A Survey
Privacy-preserving and Privacy-attacking Approaches for Speech and Audio -- A Survey
Yuchen Liu
Apu Kapadia
Donald Williamson
AAML
76
0
0
26 Sep 2023
Deep Generative Methods for Producing Forecast Trajectories in Power
  Systems
Deep Generative Methods for Producing Forecast Trajectories in Power Systems
Nathan Weill
Jonathan Dumas
AI4TS
64
0
0
26 Sep 2023
Optimization Techniques for a Physical Model of Human Vocalisation
Optimization Techniques for a Physical Model of Human Vocalisation
Mateo Cámara
Zhiyuan Xu
Yi-Chen Zong
José-Luis Blanco
Joshua D. Reiss
29
3
0
26 Sep 2023
Audio classification with Dilated Convolution with Learnable Spacings
Audio classification with Dilated Convolution with Learnable Spacings
Ismail Khalfaoui-Hassani
T. Masquelier
Thomas Pellegrini
71
1
0
25 Sep 2023
DurIAN-E: Duration Informed Attention Network For Expressive
  Text-to-Speech Synthesis
DurIAN-E: Duration Informed Attention Network For Expressive Text-to-Speech Synthesis
Yu Gu
Yianrao Bian
Guangzhi Lei
Chao Weng
Jane Polak Scowcroft
DiffM
55
2
0
22 Sep 2023
CrossSinger: A Cross-Lingual Multi-Singer High-Fidelity Singing Voice
  Synthesizer Trained on Monolingual Singers
CrossSinger: A Cross-Lingual Multi-Singer High-Fidelity Singing Voice Synthesizer Trained on Monolingual Singers
Xintong Wang
Chang Zeng
Jun Chen
Chunhui Wang
71
6
0
22 Sep 2023
Performance Conditioning for Diffusion-Based Multi-Instrument Music
  Synthesis
Performance Conditioning for Diffusion-Based Multi-Instrument Music Synthesis
Ben Maman
Johannes Zeitler
Meinard Muller
Amit H. Bermano
DiffM
59
4
0
21 Sep 2023
The Impact of Silence on Speech Anti-Spoofing
The Impact of Silence on Speech Anti-Spoofing
Yuxiang Zhang
Zhuo Li
Jingze Lu
Hua Hua
Wenchao Wang
Pengyuan Zhang
80
21
0
21 Sep 2023
SpeechAlign: a Framework for Speech Translation Alignment Evaluation
SpeechAlign: a Framework for Speech Translation Alignment Evaluation
Belen Alastruey
Aleix Sant
Gerard I. Gállego
David Dale
Marta R. Costa-jussá
AuLLM
56
3
0
20 Sep 2023
Speak While You Think: Streaming Speech Synthesis During Text Generation
Speak While You Think: Streaming Speech Synthesis During Text Generation
Avihu Dekel
Slava Shechtman
Raul Fernandez
David Haws
Zvi Kons
R. Hoory
64
9
0
20 Sep 2023
Towards Generative Modeling of Urban Flow through Knowledge-enhanced
  Denoising Diffusion
Towards Generative Modeling of Urban Flow through Knowledge-enhanced Denoising Diffusion
Zhilun Zhou
Jingtao Ding
Yu Liu
Depeng Jin
Yong Li
DiffMAI4CE
95
23
0
19 Sep 2023
Speech Synthesis By Unrolling Diffusion Process using Neural Network Layers
Speech Synthesis By Unrolling Diffusion Process using Neural Network Layers
Peter Ochieng
DiffM
56
0
0
18 Sep 2023
PromptVC: Flexible Stylistic Voice Conversion in Latent Space Driven by
  Natural Language Prompts
PromptVC: Flexible Stylistic Voice Conversion in Latent Space Driven by Natural Language Prompts
Jixun Yao
Yuguang Yang
Yinjiao Lei
Ziqian Ning
Yanni Hu
Yu Pan
Jingjing Yin
Hongbin Zhou
Heng Lu
Linfu Xie
DiffM
115
23
0
17 Sep 2023
Test-Time Compensated Representation Learning for Extreme Traffic
  Forecasting
Test-Time Compensated Representation Learning for Extreme Traffic Forecasting
Zhiwei Zhang
Weizhong Zhang
Yaowei Huang
Kani Chen
AI4TS
25
1
0
16 Sep 2023
Fewer-token Neural Speech Codec with Time-invariant Codes
Fewer-token Neural Speech Codec with Time-invariant Codes
Yong Ren
Tao Wang
Jiangyan Yi
Le Xu
Jianhua Tao
Chuyuan Zhang
Jun Zhou
85
36
0
15 Sep 2023
MASTERKEY: Practical Backdoor Attack Against Speaker Verification
  Systems
MASTERKEY: Practical Backdoor Attack Against Speaker Verification Systems
Hanqing Guo
Xun Chen
Junfeng Guo
Li Xiao
Qiben Yan
84
13
0
13 Sep 2023
CleanUNet 2: A Hybrid Speech Denoising Model on Waveform and Spectrogram
CleanUNet 2: A Hybrid Speech Denoising Model on Waveform and Spectrogram
Zhifeng Kong
Ming-Yu Liu
Ambrish Dantrey
Bryan Catanzaro
51
7
0
12 Sep 2023
AudRandAug: Random Image Augmentations for Audio Classification
AudRandAug: Random Image Augmentations for Audio Classification
Teerath Kumar
Muhammad Turab
Alessandra Mileo
Malika Bendechache
Takfarinas Saber
68
7
0
09 Sep 2023
A Two-Stage Training Framework for Joint Speech Compression and
  Enhancement
A Two-Stage Training Framework for Joint Speech Compression and Enhancement
Jiayi Huang
Zeyu Yan
Wenbin Jiang
Fei Wen
61
1
0
08 Sep 2023
Large-Scale Automatic Audiobook Creation
Large-Scale Automatic Audiobook Creation
Brendan Walsh
Mark Hamilton
Greg Newby
Xi Wang
Serena Ruan
...
Lei He
Shaofei Zhang
Eric Dettinger
William T. Freeman
Markus Weimer
68
1
0
07 Sep 2023
BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial
  Network
BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial Network
Takashi Shibuya
Yuhta Takida
Yuki Mitsufuji
71
11
0
06 Sep 2023
Self-Supervised Disentanglement of Harmonic and Rhythmic Features in
  Music Audio Signals
Self-Supervised Disentanglement of Harmonic and Rhythmic Features in Music Audio Signals
Yiming Wu
CoGeDRL
113
0
0
06 Sep 2023
MuLanTTS: The Microsoft Speech Synthesis System for Blizzard Challenge
  2023
MuLanTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2023
Zhihang Xu
Shaofei Zhang
Xi Wang
Jiajun Zhang
Wenning Wei
Lei He
Sheng Zhao
81
2
0
06 Sep 2023
Object Size-Driven Design of Convolutional Neural Networks: Virtual Axle
  Detection based on Raw Data
Object Size-Driven Design of Convolutional Neural Networks: Virtual Axle Detection based on Raw Data
Henik Riedel
Robert Steven Lorenzen
Clemens Hubler
79
1
0
04 Sep 2023
NADiffuSE: Noise-aware Diffusion-based Model for Speech Enhancement
NADiffuSE: Noise-aware Diffusion-based Model for Speech Enhancement
Wen Wang
Dongchao Yang
Qichen Ye
Bowen Cao
Yuexian Zou
DiffM
97
3
0
03 Sep 2023
Advances in machine-learning-based sampling motivated by lattice quantum
  chromodynamics
Advances in machine-learning-based sampling motivated by lattice quantum chromodynamics
Kyle Cranmer
G. Kanwar
S. Racanière
Danilo Jimenez Rezende
P. Shanahan
AI4CE
96
26
0
03 Sep 2023
Timbre-reserved Adversarial Attack in Speaker Identification
Timbre-reserved Adversarial Attack in Speaker Identification
Qing Wang
Jixun Yao
Li Zhang
Pengcheng Guo
Linfu Xie
AAML
79
4
0
02 Sep 2023
The FruitShell French synthesis system at the Blizzard 2023 Challenge
The FruitShell French synthesis system at the Blizzard 2023 Challenge
Xin Qi
Xiaopeng Wang
Zhiyong Wang
Wang Liu
Mingming Ding
Shuchen Shi
25
1
0
01 Sep 2023
Ten Years of Generative Adversarial Nets (GANs): A survey of the
  state-of-the-art
Ten Years of Generative Adversarial Nets (GANs): A survey of the state-of-the-art
Tanujit Chakraborty
Ujjwal Reddy K S
Shraddha M. Naik
Madhurima Panja
B. Manvitha
112
74
0
30 Aug 2023
RoboTAP: Tracking Arbitrary Points for Few-Shot Visual Imitation
RoboTAP: Tracking Arbitrary Points for Few-Shot Visual Imitation
Mel Vecerík
Carl Doersch
Yi Yang
Todor Davchev
Y. Aytar
Guangyao Zhou
R. Hadsell
Lourdes Agapito
Jonathan Scholz
128
55
0
30 Aug 2023
MASA-TCN: Multi-anchor Space-aware Temporal Convolutional Neural
  Networks for Continuous and Discrete EEG Emotion Recognition
MASA-TCN: Multi-anchor Space-aware Temporal Convolutional Neural Networks for Continuous and Discrete EEG Emotion Recognition
Yi Ding
Su Zhang
Chuangao Tang
Cuntai Guan
64
12
0
30 Aug 2023
A Review of Differentiable Digital Signal Processing for Music & Speech
  Synthesis
A Review of Differentiable Digital Signal Processing for Music & Speech Synthesis
B. Hayes
Jordie Shier
Gyorgy Fazekas
Andrew Mcpherson
C. Saitis
83
25
0
29 Aug 2023
MSFlow: Multi-Scale Flow-based Framework for Unsupervised Anomaly
  Detection
MSFlow: Multi-Scale Flow-based Framework for Unsupervised Anomaly Detection
Yixuan Zhou
Xing Xu
Jingkuan Song
Fumin Shen
Hengtao Shen
AI4CE
116
22
0
29 Aug 2023
Audio Deepfake Detection: A Survey
Audio Deepfake Detection: A Survey
Jiangyan Yi
Chenglong Wang
J. Tao
Xiaohui Zhang
Chu Yuan Zhang
Yan Zhao
125
52
0
29 Aug 2023
Comparing AutoML and Deep Learning Methods for Condition Monitoring
  using Realistic Validation Scenarios
Comparing AutoML and Deep Learning Methods for Condition Monitoring using Realistic Validation Scenarios
P. Goodarzi
A. Schütze
T. Schneider
44
1
0
28 Aug 2023
Meta Attentive Graph Convolutional Recurrent Network for Traffic
  Forecasting
Meta Attentive Graph Convolutional Recurrent Network for Traffic Forecasting
Adnan Zeb
Yongchao Ye
Shiyao Zhang
James Jianqiao Yu
AI4TS
88
0
0
28 Aug 2023
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
Lin Geng Foo
Hossein Rahmani
Jing Liu
278
31
0
27 Aug 2023
Unaligned 2D to 3D Translation with Conditional Vector-Quantized Code
  Diffusion using Transformers
Unaligned 2D to 3D Translation with Conditional Vector-Quantized Code Diffusion using Transformers
Abril Corona-Figueroa
Sam Bond-Taylor
Neelanjan Bhowmik
Yona Falinie A. Gaus
T. Breckon
Hubert P. H. Shum
Chris G. Willcocks
DiffM
87
4
0
27 Aug 2023
A Comprehensive Survey for Evaluation Methodologies of AI-Generated
  Music
A Comprehensive Survey for Evaluation Methodologies of AI-Generated Music
Zeyu Xiong
Weitao Wang
Jing Yu
Yue Lin
Ziyan Wang
MGen
83
7
0
26 Aug 2023
Business Metric-Aware Forecasting for Inventory Management
Business Metric-Aware Forecasting for Inventory Management
Helen Zhou
Sercan O. Arik
Jingtao Wang
AI4TS
62
4
0
24 Aug 2023
Unified Data Management and Comprehensive Performance Evaluation for
  Urban Spatial-Temporal Prediction [Experiment, Analysis & Benchmark]
Unified Data Management and Comprehensive Performance Evaluation for Urban Spatial-Temporal Prediction [Experiment, Analysis & Benchmark]
Jiawei Jiang
Chengkai Han
Wayne Xin Zhao
Jingyuan Wang
92
2
0
24 Aug 2023
Previous
123...101112...606162
Next