ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.03499
  4. Cited By
WaveNet: A Generative Model for Raw Audio

WaveNet: A Generative Model for Raw Audio

12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
    DiffM
ArXivPDFHTML

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,039 papers shown
Title
Leveraging Wastewater Monitoring for COVID-19 Forecasting in the US: a
  Deep Learning study
Leveraging Wastewater Monitoring for COVID-19 Forecasting in the US: a Deep Learning study
Mehrdad Fazli
Heman Shakeri
29
1
0
17 Dec 2022
Convolution-enhanced Evolving Attention Networks
Convolution-enhanced Evolving Attention Networks
Yujing Wang
Yaming Yang
Zhuowan Li
Jiangang Bai
Mingliang Zhang
Xiangtai Li
Jiahao Yu
Ce Zhang
Gao Huang
Yu Tong
ViT
32
6
0
16 Dec 2022
Text-to-speech synthesis based on latent variable conversion using
  diffusion probabilistic model and variational autoencoder
Text-to-speech synthesis based on latent variable conversion using diffusion probabilistic model and variational autoencoder
Yusuke Yasuda
Tomoki Toda
DiffM
22
7
0
16 Dec 2022
RWEN-TTS: Relation-aware Word Encoding Network for Natural
  Text-to-Speech Synthesis
RWEN-TTS: Relation-aware Word Encoding Network for Natural Text-to-Speech Synthesis
Shinhyeok Oh
HyeongRae Noh
Yoonseok Hong
Insoo Oh
25
0
0
15 Dec 2022
Hierarchical Strategies for Cooperative Multi-Agent Reinforcement
  Learning
Hierarchical Strategies for Cooperative Multi-Agent Reinforcement Learning
M. Ibrahim
Ammar Fayad
30
1
0
14 Dec 2022
Image Compression with Product Quantized Masked Image Modeling
Image Compression with Product Quantized Masked Image Modeling
Alaaeldin El-Nouby
Matthew Muckley
Karen Ullrich
Ivan Laptev
Jakob Verbeek
Hervé Jégou
MQ
32
31
0
14 Dec 2022
Fully complex-valued deep learning model for visual perception
Fully complex-valued deep learning model for visual perception
Aniruddh Sikdar
Sumanth Udupa
Suresh Sundaram
30
2
0
14 Dec 2022
Improving Accuracy Without Losing Interpretability: A ML Approach for
  Time Series Forecasting
Improving Accuracy Without Losing Interpretability: A ML Approach for Time Series Forecasting
Yiqi Sun
Zheng Shi
Jianshen Zhang
Yongzhi Qi
Hao Hu
Zuo-jun Shen
AI4TS
18
0
0
13 Dec 2022
MegaCRN: Meta-Graph Convolutional Recurrent Network for Spatio-Temporal Modeling
Renhe Jiang
Zhaonan Wang
Jiawei Yong
P. Jeph
Quanjun Chen
Yasumasa Kobayashi
Xuan Song
Toyotaro Suzumura
Shintaro Fukushima
GNN
BDL
AI4TS
39
4
0
12 Dec 2022
MnTTS2: An Open-Source Multi-Speaker Mongolian Text-to-Speech Synthesis
  Dataset
MnTTS2: An Open-Source Multi-Speaker Mongolian Text-to-Speech Synthesis Dataset
Kailin Liang
Bin Liu
Yifan Hu
Rui Liu
F. Bao
Guanglai Gao
41
1
0
11 Dec 2022
FAIR AI Models in High Energy Physics
FAIR AI Models in High Energy Physics
Javier Mauricio Duarte
Haoyang Li
Avik Roy
Ruike Zhu
Eliu A. Huerta
...
Mark S. Neubauer
Sang Eon Park
M. Quinnan
R. Rusack
Zhizhen Zhao
41
8
0
09 Dec 2022
Learning Options via Compression
Learning Options via Compression
Yiding Jiang
E. Liu
Benjamin Eysenbach
Zico Kolter
Chelsea Finn
OffRL
30
13
0
08 Dec 2022
Knowledge Distillation Applied to Optical Channel Equalization: Solving
  the Parallelization Problem of Recurrent Connection
Knowledge Distillation Applied to Optical Channel Equalization: Solving the Parallelization Problem of Recurrent Connection
S. Srivallapanondh
Pedro J. Freire
B. Spinnler
N. Costa
A. Napoli
S. Turitsyn
Jaroslaw E. Prilepsky
14
12
0
08 Dec 2022
Framewise WaveGAN: High Speed Adversarial Vocoder in Time Domain with
  Very Low Computational Complexity
Framewise WaveGAN: High Speed Adversarial Vocoder in Time Domain with Very Low Computational Complexity
Ahmed Mustafa
J. Valin
Jan Büthe
Paris Smaragdis
Mike Goodwin
30
4
0
08 Dec 2022
GreenEyes: An Air Quality Evaluating Model based on WaveNet
GreenEyes: An Air Quality Evaluating Model based on WaveNet
Kan Huang
Kai Zhang
Ming-de Liu
17
2
0
08 Dec 2022
Short term prediction of demand for ride hailing services: A deep
  learning approach
Short term prediction of demand for ride hailing services: A deep learning approach
Long Chen
Piyushimita
P. Thakuriah
K. Ampountolas
AI4TS
33
21
0
07 Dec 2022
Criteria for Classifying Forecasting Methods
Criteria for Classifying Forecasting Methods
Tim Januschowski
Jan Gasthaus
Bernie Wang
David Salinas
Valentin Flunkert
Michael Bohlke-Schneider
Laurent Callot
AI4TS
26
173
0
07 Dec 2022
Deep conv-attention model for diagnosing left bundle branch block from 12-lead electrocardiograms
Ali Sadeghi
A. Rezaee
F. Hajati
22
1
0
07 Dec 2022
A K-variate Time Series Is Worth K Words: Evolution of the Vanilla
  Transformer Architecture for Long-term Multivariate Time Series Forecasting
A K-variate Time Series Is Worth K Words: Evolution of the Vanilla Transformer Architecture for Long-term Multivariate Time Series Forecasting
Zanwei Zhou
Rui-Ming Zhong
Chen Yang
Yan Wang
Xiaokang Yang
Wei Shen
AI4TS
53
9
0
06 Dec 2022
Lossy Compression for Robust Unsupervised Time-Series Anomaly Detection
Lossy Compression for Robust Unsupervised Time-Series Anomaly Detection
Christophe Ley
Jorge F. Silva
OOD
CML
22
1
0
05 Dec 2022
Evince the artifacts of Spoof Speech by blending Vocal Tract and Voice
  Source Features
Evince the artifacts of Spoof Speech by blending Vocal Tract and Voice Source Features
T. U. K. Reddy
Sahukari Chaitanya Varun
Kota Pranav Kumar Sankala Sreekanth
K. Murty
23
0
0
05 Dec 2022
Deep neural network techniques for monaural speech enhancement: state of
  the art analysis
Deep neural network techniques for monaural speech enhancement: state of the art analysis
P. Ochieng
40
21
0
01 Dec 2022
ConvLab-3: A Flexible Dialogue System Toolkit Based on a Unified Data
  Format
ConvLab-3: A Flexible Dialogue System Toolkit Based on a Unified Data Format
Qi Zhu
Christian Geishauser
Hsien-Chin Lin
Carel van Niekerk
Baolin Peng
...
Dazhen Wan
Xiaochen Zhu
Jianfeng Gao
Milica Gavsić
Minlie Huang
56
23
0
30 Nov 2022
Extreme Audio Time Stretching Using Neural Synthesis
Extreme Audio Time Stretching Using Neural Synthesis
Leonardo Fierro
Alec Wright
Vesa Valimaki
Matti Hämäläinen
22
1
0
30 Nov 2022
SNAC: Speaker-normalized affine coupling layer in flow-based
  architecture for zero-shot multi-speaker text-to-speech
SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speaker text-to-speech
Byoung Jin Choi
Myeonghun Jeong
Joun Yeop Lee
N. Kim
28
13
0
30 Nov 2022
AirFormer: Predicting Nationwide Air Quality in China with Transformers
AirFormer: Predicting Nationwide Air Quality in China with Transformers
Keli Zhang
Yutong Xia
Songyu Ke
Yiwei Wang
Qingsong Wen
Junbo Zhang
Yu Zheng
Roger Zimmermann
AI4TS
AI4CE
24
107
0
29 Nov 2022
Lipschitz constant estimation for 1D convolutional neural networks
Lipschitz constant estimation for 1D convolutional neural networks
Patricia Pauli
Dennis Gramlich
Frank Allgöwer
31
13
0
28 Nov 2022
Traditional Classification Neural Networks are Good Generators: They are
  Competitive with DDPMs and GANs
Traditional Classification Neural Networks are Good Generators: They are Competitive with DDPMs and GANs
Guangrun Wang
Philip Torr
33
8
0
27 Nov 2022
Spatio-Temporal Meta-Graph Learning for Traffic Forecasting
Spatio-Temporal Meta-Graph Learning for Traffic Forecasting
Renhe Jiang
Zhaonan Wang
Jiawei Yong
P. Jeph
Quanjun Chen
Yasumasa Kobayashi
Xuan Song
Shintaro Fukushima
Toyotaro Suzumura
AI4TS
32
181
0
27 Nov 2022
A Maximum Log-Likelihood Method for Imbalanced Few-Shot Learning Tasks
A Maximum Log-Likelihood Method for Imbalanced Few-Shot Learning Tasks
Samuel Hess
G. Ditzler
64
2
0
26 Nov 2022
Deep Fake Detection, Deterrence and Response: Challenges and
  Opportunities
Deep Fake Detection, Deterrence and Response: Challenges and Opportunities
Amin Azmoodeh
Ali Dehghantanha
45
2
0
26 Nov 2022
A System for Morphology-Task Generalization via Unified Representation
  and Behavior Distillation
A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation
Hiroki Furuta
Yusuke Iwasawa
Yutaka Matsuo
S. Gu
33
16
0
25 Nov 2022
Puffin: pitch-synchronous neural waveform generation for fullband speech
  on modest devices
Puffin: pitch-synchronous neural waveform generation for fullband speech on modest devices
O. Watts
Lovisa Wihlborg
Cassia Valentini-Botinhao
38
3
0
25 Nov 2022
Efficient Incremental Text-to-Speech on GPUs
Efficient Incremental Text-to-Speech on GPUs
Muyang Du
Chuan Liu
Jiaxing Qi
Junjie Lai
26
1
0
25 Nov 2022
Generative Modeling in Sinogram Domain for Sparse-view CT Reconstruction
Generative Modeling in Sinogram Domain for Sparse-view CT Reconstruction
Bing Guan
Cailian Yang
Liu Zhang
S. Niu
Minghui Zhang
Yuhao Wang
Weiwen Wu
Qiegen Liu
DiffM
MedIm
38
41
0
25 Nov 2022
HouseDiffusion: Vector Floorplan Generation via a Diffusion Model with
  Discrete and Continuous Denoising
HouseDiffusion: Vector Floorplan Generation via a Diffusion Model with Discrete and Continuous Denoising
M. Shabani
Sepidehsadat Hosseini
Yasutaka Furukawa
DiffM
33
58
0
23 Nov 2022
Embedding a Differentiable Mel-cepstral Synthesis Filter to a Neural
  Speech Synthesis System
Embedding a Differentiable Mel-cepstral Synthesis Filter to a Neural Speech Synthesis System
Takenori Yoshimura
Shinji Takaki
Kazuhiro Nakamura
Keiichiro Oura
Yukiya Hono
Kei Hashimoto
Yoshihiko Nankaku
K. Tokuda
37
7
0
21 Nov 2022
VarietySound: Timbre-Controllable Video to Sound Generation via
  Unsupervised Information Disentanglement
VarietySound: Timbre-Controllable Video to Sound Generation via Unsupervised Information Disentanglement
Chenye Cui
Yi Ren
Jinglin Liu
Rongjie Huang
Zhou Zhao
VGen
40
14
0
19 Nov 2022
Step Counting with Attention-based LSTM
Step Counting with Attention-based LSTM
Shehroz S. Khan
Ali Abedi
HAI
24
8
0
18 Nov 2022
Listen, Denoise, Action! Audio-Driven Motion Synthesis with Diffusion
  Models
Listen, Denoise, Action! Audio-Driven Motion Synthesis with Diffusion Models
Simon Alexanderson
Rajmund Nagy
Jonas Beskow
G. Henter
DiffM
VGen
32
166
0
17 Nov 2022
Towards Building Text-To-Speech Systems for the Next Billion Users
Towards Building Text-To-Speech Systems for the Next Billion Users
Gokul Karthik Kumar
V. PraveenS.
Pratyush Kumar
Mitesh M. Khapra
Karthik Nandakumar
43
18
0
17 Nov 2022
A Review of Intelligent Music Generation Systems
A Review of Intelligent Music Generation Systems
Lei Wang
Ziyi Zhao
Han Liu
Junwei Pang
Yi-qiang Qin
Qidi Wu
MGen
26
31
0
16 Nov 2022
Versatile Diffusion: Text, Images and Variations All in One Diffusion
  Model
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model
Xingqian Xu
Zhangyang Wang
Eric Zhang
Kai Wang
Humphrey Shi
DiffM
43
187
0
15 Nov 2022
General Intelligence Requires Rethinking Exploration
General Intelligence Requires Rethinking Exploration
Minqi Jiang
Tim Rocktaschel
Edward Grefenstette
LRM
34
18
0
15 Nov 2022
Diffusion Models for Medical Image Analysis: A Comprehensive Survey
Diffusion Models for Medical Image Analysis: A Comprehensive Survey
A. Kazerouni
Ehsan Khodapanah Aghdam
Moein Heidari
Reza Azad
Mohsen Fayyaz
Ilker Hacihaliloglu
Dorit Merhof
DiffM
MedIm
56
359
0
14 Nov 2022
Autovocoder: Fast Waveform Generation from a Learned Speech
  Representation using Differentiable Digital Signal Processing
Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
J. Webber
Cassia Valentini-Botinhao
Evelyn Williams
G. Henter
Simon King
16
9
0
13 Nov 2022
HigeNet: A Highly Efficient Modeling for Long Sequence Time Series
  Prediction in AIOps
HigeNet: A Highly Efficient Modeling for Long Sequence Time Series Prediction in AIOps
Jiajia Li
Feng Tan
Cheng He
Zikai Wang
Haitao Song
Lingfei Wu
Pengwei Hu
28
0
0
13 Nov 2022
Online Phase Reconstruction via DNN-based Phase Differences Estimation
Online Phase Reconstruction via DNN-based Phase Differences Estimation
Yoshiki Masuyama
Kohei Yatabe
Kento Nagatomo
Yasuhiro Oikawa
3DV
21
7
0
12 Nov 2022
GANStrument: Adversarial Instrument Sound Synthesis with Pitch-invariant
  Instance Conditioning
GANStrument: Adversarial Instrument Sound Synthesis with Pitch-invariant Instance Conditioning
Gaku Narita
Junichi Shimizu
Taketo Akama
GAN
34
11
0
10 Nov 2022
Accented Text-to-Speech Synthesis with a Conditional Variational
  Autoencoder
Accented Text-to-Speech Synthesis with a Conditional Variational Autoencoder
J. Melechovský
Ambuj Mehrish
Berrak Sisman
Dorien Herremans
30
6
0
07 Nov 2022
Previous
123...151617...596061
Next