ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.03499
  4. Cited By
WaveNet: A Generative Model for Raw Audio

WaveNet: A Generative Model for Raw Audio

12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
    DiffM
ArXivPDFHTML

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,039 papers shown
Title
Spatio-Temporal Momentum: Jointly Learning Time-Series and
  Cross-Sectional Strategies
Spatio-Temporal Momentum: Jointly Learning Time-Series and Cross-Sectional Strategies
Wee Ling Tan
Stephen J. Roberts
S. Zohren
AI4TS
AIFin
27
10
0
20 Feb 2023
Because Every Sensor Is Unique, so Is Every Pair: Handling Dynamicity in
  Traffic Forecasting
Because Every Sensor Is Unique, so Is Every Pair: Handling Dynamicity in Traffic Forecasting
Arian Prabowo
Wei Shao
Hao Xue
Piotr Koniusz
Flora D. Salim
AI4TS
44
14
0
20 Feb 2023
Exposing AI-Synthesized Human Voices Using Neural Vocoder Artifacts
Exposing AI-Synthesized Human Voices Using Neural Vocoder Artifacts
Chengzhe Sun
Shan Jia
Shuwei Hou
Ehab AlBadawy
Siwei Lyu
135
3
0
18 Feb 2023
DTAAD: Dual Tcn-Attention Networks for Anomaly Detection in Multivariate
  Time Series Data
DTAAD: Dual Tcn-Attention Networks for Anomaly Detection in Multivariate Time Series Data
Ling Yu
AI4TS
30
27
0
17 Feb 2023
Continuous-time convolutions model of event sequences
Continuous-time convolutions model of event sequences
Vladislav Zhuzhel
Vsevolod Grabar
Galina Boeva
Artem Zabolotnyi
Alexander Stepikin
...
Mikhail Orlov
Ivan Kireev
Evgeny Burnaev
Rodrigo Rivera-Castro
Alexey Zaytsev
AI4TS
8
0
0
13 Feb 2023
Fast and small footprint Hybrid HMM-HiFiGAN based system for speech
  synthesis in Indian languages
Fast and small footprint Hybrid HMM-HiFiGAN based system for speech synthesis in Indian languages
Sudhanshu Srivastava
Ishika Gupta
Anusha Prakash
Jom Kuriakose
H. Murthy
VLM
21
1
0
13 Feb 2023
Vector Quantized Wasserstein Auto-Encoder
Vector Quantized Wasserstein Auto-Encoder
Tung-Long Vuong
Trung Le
He Zhao
Chuanxia Zheng
Mehrtash Harandi
Jianfei Cai
Dinh Q. Phung
DRL
47
17
0
12 Feb 2023
SLOTH: Structured Learning and Task-based Optimization for Time Series
  Forecasting on Hierarchies
SLOTH: Structured Learning and Task-based Optimization for Time Series Forecasting on Hierarchies
Fang-le Zhou
Chenle Pan
Lintao Ma
Yu Liu
Shiyu Wang
...
Xu Hu
Yun Hu
Yang Zheng
Lei Lei
Yun Hu
29
3
0
11 Feb 2023
Pruning Deep Neural Networks from a Sparsity Perspective
Pruning Deep Neural Networks from a Sparsity Perspective
Enmao Diao
G. Wang
Jiawei Zhan
Yuhong Yang
Jie Ding
Vahid Tarokh
32
30
0
11 Feb 2023
DNArch: Learning Convolutional Neural Architectures by Backpropagation
DNArch: Learning Convolutional Neural Architectures by Backpropagation
David W. Romero
Neil Zeghidour
AI4CE
27
4
0
10 Feb 2023
Hypernetworks build Implicit Neural Representations of Sounds
Hypernetworks build Implicit Neural Representations of Sounds
Filip Szatkowski
Karol J. Piczak
Przemtslaw Spurek
Jacek Tabor
Tomasz Trzciñski
29
11
0
09 Feb 2023
ERNIE-Music: Text-to-Waveform Music Generation with Diffusion Models
ERNIE-Music: Text-to-Waveform Music Generation with Diffusion Models
Peng Fei Zhu
Chao Pang
Yekun Chai
Lei Li
Shuohuan Wang
Yu Sun
Hao Tian
Hua Wu
DiffM
19
20
0
09 Feb 2023
Short-Term Memory Convolutions
Short-Term Memory Convolutions
Grzegorz Stefański
Krzysztof Arendt
P. Daniluk
Bartlomiej Jasik
Artur Szumaczuk
23
4
0
08 Feb 2023
Machine Learning for Synthetic Data Generation: A Review
Machine Learning for Synthetic Data Generation: A Review
Ying-Cheng Lu
Minjie Shen
Huazheng Wang
Xiao Wang
Capucine Van Rechem
Tianfan Fu
Wenqi Wei
SyDa
49
140
0
08 Feb 2023
Multi-Source Diffusion Models for Simultaneous Music Generation and
  Separation
Multi-Source Diffusion Models for Simultaneous Music Generation and Separation
Giorgio Mariani
Irene Tallini
Emilian Postolache
Michele Mancusi
Luca Cosmo
Emanuele Rodolà
DiffM
32
38
0
04 Feb 2023
Multivariate Time Series Anomaly Detection via Dynamic Graph Forecasting
Multivariate Time Series Anomaly Detection via Dynamic Graph Forecasting
Katrina Chen
M. Feng
T. Wirjanto
AI4TS
13
6
0
04 Feb 2023
Time Series Forecasting via Semi-Asymmetric Convolutional Architecture
  with Global Atrous Sliding Window
Time Series Forecasting via Semi-Asymmetric Convolutional Architecture with Global Atrous Sliding Window
Yuanpeng He
AI4TS
43
0
0
31 Jan 2023
DiffSTG: Probabilistic Spatio-Temporal Graph Forecasting with Denoising
  Diffusion Models
DiffSTG: Probabilistic Spatio-Temporal Graph Forecasting with Denoising Diffusion Models
Haomin Wen
Youfang Lin
Yutong Xia
Huaiyu Wan
Qingsong Wen
Roger Zimmermann
Keli Zhang
DiffM
31
83
0
31 Jan 2023
ArchiSound: Audio Generation with Diffusion
ArchiSound: Audio Generation with Diffusion
Flavio Schneider
21
23
0
30 Jan 2023
Deep networks for system identification: a Survey
Deep networks for system identification: a Survey
G. Pillonetto
Aleksandr Aravkin
Daniel Gedon
L. Ljung
Antônio H. Ribeiro
Thomas B. Schon
OOD
42
37
0
30 Jan 2023
SingSong: Generating musical accompaniments from singing
SingSong: Generating musical accompaniments from singing
Chris Donahue
Antoine Caillon
Adam Roberts
Ethan Manilow
P. Esling
...
Mauro Verzetti
Ian Simon
Olivier Pietquin
Neil Zeghidour
Jesse Engel
40
52
0
30 Jan 2023
Do We Really Need Graph Neural Networks for Traffic Forecasting?
Do We Really Need Graph Neural Networks for Traffic Forecasting?
Xu Liu
Keli Zhang
Chao Huang
Hengchang Hu
Yushi Cao
Bryan Hooi
Roger Zimmermann
AI4TS
28
20
0
30 Jan 2023
AudioLDM: Text-to-Audio Generation with Latent Diffusion Models
AudioLDM: Text-to-Audio Generation with Latent Diffusion Models
Haohe Liu
Zehua Chen
Yiitan Yuan
Xinhao Mei
Xubo Liu
Danilo Mandic
Wenwu Wang
Mark D. Plumbley
DiffM
49
473
0
29 Jan 2023
On granularity of prosodic representations in expressive text-to-speech
On granularity of prosodic representations in expressive text-to-speech
Mikolaj Babianski
Kamil Pokora
Raahil Shah
Rafał Sienkiewicz
Daniel Korzekwa
V. Klimkov
37
5
0
26 Jan 2023
MusicLM: Generating Music From Text
MusicLM: Generating Music From Text
A. Agostinelli
Timo I. Denk
Zalan Borsos
Jesse Engel
Mauro Verzetti
...
Adam Roberts
Marco Tagliasacchi
Matthew Sharifi
Neil Zeghidour
Christian Frank
MGen
55
418
0
26 Jan 2023
Open Problems in Applied Deep Learning
Open Problems in Applied Deep Learning
M. Raissi
AI4CE
55
2
0
26 Jan 2023
Separate And Diffuse: Using a Pretrained Diffusion Model for Improving
  Source Separation
Separate And Diffuse: Using a Pretrained Diffusion Model for Improving Source Separation
Shahar Lutati
Eliya Nachmani
Lior Wolf
DiffM
38
14
0
25 Jan 2023
Modelling Long Range Dependencies in $N$D: From Task-Specific to a
  General Purpose CNN
Modelling Long Range Dependencies in NNND: From Task-Specific to a General Purpose CNN
David M. Knigge
David W. Romero
Albert Gu
E. Gavves
Erik J. Bekkers
Jakub M. Tomczak
Mark Hoogendoorn
Jan-Jakob Sonke
3DV
40
21
0
25 Jan 2023
FInC Flow: Fast and Invertible $k \times k$ Convolutions for Normalizing
  Flows
FInC Flow: Fast and Invertible k×kk \times kk×k Convolutions for Normalizing Flows
Aditya Kallappa
Sandeep Nagar
Girish Varma
27
2
0
23 Jan 2023
Dance2MIDI: Dance-driven multi-instruments music generation
Dance2MIDI: Dance-driven multi-instruments music generation
Bo Han
Yuheng Li
Yixuan Shen
Yi Ren
Feilin Han
26
5
0
22 Jan 2023
Regeneration Learning: A Learning Paradigm for Data Generation
Regeneration Learning: A Learning Paradigm for Data Generation
Xu Tan
Tao Qin
Jiang Bian
Tie-Yan Liu
Yoshua Bengio
GAN
40
15
0
21 Jan 2023
Novel-View Acoustic Synthesis
Novel-View Acoustic Synthesis
Changan Chen
Alexander Richard
Roman Shapovalov
V. Ithapu
Natalia Neverova
Kristen Grauman
Andrea Vedaldi
32
33
0
20 Jan 2023
An investigation of the reconstruction capacity of stacked convolutional
  autoencoders for log-mel-spectrograms
An investigation of the reconstruction capacity of stacked convolutional autoencoders for log-mel-spectrograms
Anastasia Natsiou
Luca Longo
Seán O'Leary
16
0
0
18 Jan 2023
A Transformer-based Diffusion Probabilistic Model for Heart Rate and
  Blood Pressure Forecasting in Intensive Care Unit
A Transformer-based Diffusion Probabilistic Model for Heart Rate and Blood Pressure Forecasting in Intensive Care Unit
Ping Chang
Huayu Li
S. Quan
Shuyang Lu
Shu-Fen Wung
Janet Roveda
Ao Li
DiffM
19
17
0
16 Jan 2023
CNN-Based Action Recognition and Pose Estimation for Classifying Animal
  Behavior from Videos: A Survey
CNN-Based Action Recognition and Pose Estimation for Classifying Animal Behavior from Videos: A Survey
Michael Perez
Corey Toler-Franklin
MedIm
38
14
0
15 Jan 2023
A Comprehensive Review of Data-Driven Co-Speech Gesture Generation
A Comprehensive Review of Data-Driven Co-Speech Gesture Generation
Simbarashe Nyatsanga
Taras Kucherenko
Chaitanya Ahuja
G. Henter
Michael Neff
SLR
46
90
0
13 Jan 2023
Spectral Cross-Domain Neural Network with Soft-adaptive Threshold
  Spectral Enhancement
Spectral Cross-Domain Neural Network with Soft-adaptive Threshold Spectral Enhancement
Che Liu
Sibo Cheng
Weiping Ding
Rossella Arcucci
42
9
0
10 Jan 2023
Introducing Model Inversion Attacks on Automatic Speaker Recognition
Introducing Model Inversion Attacks on Automatic Speaker Recognition
Karla Pizzi
Franziska Boenisch
U. Sahin
Konstantin Böttinger
33
3
0
09 Jan 2023
Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
Chengyi Wang
Sanyuan Chen
Yu-Huan Wu
Zi-Hua Zhang
Long Zhou
...
Huaming Wang
Jinyu Li
Lei He
Sheng Zhao
Furu Wei
48
654
0
05 Jan 2023
Autonomous Drone Racing: A Survey
Autonomous Drone Racing: A Survey
D. Hanover
Antonio Loquercio
L. Bauersfeld
Angel Romero
Robert Pěnička
Yunlong Song
Giovanni Cioffi
Elia Kaufmann
Davide Scaramuzza
67
58
0
04 Jan 2023
An end-to-end multi-scale network for action prediction in videos
An end-to-end multi-scale network for action prediction in videos
Xiaofan Liu
Jianqin Yin
Yuanxi Sun
Zhicheng Zhang
Jin Tang
27
0
0
31 Dec 2022
Blind Restoration of Real-World Audio by 1D Operational GANs
Blind Restoration of Real-World Audio by 1D Operational GANs
T. Ince
S. Kiranyaz
Ozer Can Devecioglu
Muhammad Salman Khan
Muhammad Chowdhury
Moncef Gabbouj
27
4
0
30 Dec 2022
ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to
  Speech
ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech
Ze Chen
Yihan Wu
Yichong Leng
Jiawei Chen
Haohe Liu
...
Ke Wang
Lei He
Sheng Zhao
Jiang Bian
Danilo Mandic
DiffM
37
22
0
30 Dec 2022
Voice conversion with limited data and limitless data augmentations
Voice conversion with limited data and limitless data augmentations
Olga Slizovskaia
Jordi Janer
Pritish Chandna
Oscar Mayor
35
1
0
27 Dec 2022
Neural Shape Compiler: A Unified Framework for Transforming between
  Text, Point Cloud, and Program
Neural Shape Compiler: A Unified Framework for Transforming between Text, Point Cloud, and Program
Tiange Luo
Honglak Lee
Justin Johnson
44
5
0
25 Dec 2022
Deep Latent State Space Models for Time-Series Generation
Deep Latent State Space Models for Time-Series Generation
Linqi Zhou
Michael Poli
Winnie Xu
Stefano Massaroli
Stefano Ermon
BDL
AI4TS
14
34
0
24 Dec 2022
A Mathematical Framework for Learning Probability Distributions
A Mathematical Framework for Learning Probability Distributions
Hongkang Yang
43
7
0
22 Dec 2022
End-to-End Automatic Speech Recognition model for the Sudanese Dialect
End-to-End Automatic Speech Recognition model for the Sudanese Dialect
Ayman Mansour
Wafaa F. Mukhtar
27
1
0
21 Dec 2022
Dissecting Transformer Length Extrapolation via the Lens of Receptive
  Field Analysis
Dissecting Transformer Length Extrapolation via the Lens of Receptive Field Analysis
Ta-Chung Chi
Ting-Han Fan
Alexander I. Rudnicky
Peter J. Ramadge
36
40
0
20 Dec 2022
Contextually Enhanced ES-dRNN with Dynamic Attention for Short-Term Load
  Forecasting
Contextually Enhanced ES-dRNN with Dynamic Attention for Short-Term Load Forecasting
Slawek Smyl
Grzegorz Dudek
Paweł Pełka
AI4TS
39
14
0
18 Dec 2022
Previous
123...141516...596061
Next