ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.03499
  4. Cited By
WaveNet: A Generative Model for Raw Audio
v1v2 (latest)

WaveNet: A Generative Model for Raw Audio

12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
    DiffM
ArXiv (abs)PDFHTML

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,082 papers shown
Title
Towards Rapid and Robust Adversarial Training with One-Step Attacks
Towards Rapid and Robust Adversarial Training with One-Step Attacks
Leo Schwinn
René Raab
Björn Eskofier
AAML
79
6
0
24 Feb 2020
Omni-Scale CNNs: a simple and effective kernel size configuration for
  time series classification
Omni-Scale CNNs: a simple and effective kernel size configuration for time series classification
Wensi Tang
Guodong Long
Lu Liu
Dinesh Manocha
Michael Blumenstein
Jing Jiang
AI4TS
120
106
0
24 Feb 2020
On the Modularity of Hypernetworks
On the Modularity of Hypernetworks
Tomer Galanti
Lior Wolf
65
5
0
23 Feb 2020
PolyGen: An Autoregressive Generative Model of 3D Meshes
PolyGen: An Autoregressive Generative Model of 3D Meshes
C. Nash
Yaroslav Ganin
A. Eslami
Peter W. Battaglia
AI4CE
118
262
0
23 Feb 2020
Predictive Sampling with Forecasting Autoregressive Models
Predictive Sampling with Forecasting Autoregressive Models
Auke Wiggers
Emiel Hoogeboom
BDL
75
16
0
23 Feb 2020
Transformer Hawkes Process
Transformer Hawkes Process
Simiao Zuo
Haoming Jiang
Zichong Li
T. Zhao
H. Zha
AI4TS
87
295
0
21 Feb 2020
Wavesplit: End-to-End Speech Separation by Speaker Clustering
Wavesplit: End-to-End Speech Separation by Speaker Clustering
Neil Zeghidour
David Grangier
VLM
118
265
0
20 Feb 2020
Imputer: Sequence Modelling via Imputation and Dynamic Programming
Imputer: Sequence Modelling via Imputation and Dynamic Programming
William Chan
Chitwan Saharia
Geoffrey E. Hinton
Mohammad Norouzi
Navdeep Jaitly
BDLAI4TS
97
116
0
20 Feb 2020
A Neural Lip-Sync Framework for Synthesizing Photorealistic Virtual News
  Anchors
A Neural Lip-Sync Framework for Synthesizing Photorealistic Virtual News Anchors
Ruobing Zheng
Zhou Zhu
Bo Song
Changjiang Ji
3DH
57
2
0
20 Feb 2020
An empirical study of Conv-TasNet
An empirical study of Conv-TasNet
Berkan Kadıoğlu
Michael Horgan
Xiaoyu Liu
Jordi Pons
Dan Darcy
Vivek Kumar
42
44
0
20 Feb 2020
Neural Network Compression Framework for fast model inference
Neural Network Compression Framework for fast model inference
Alexander Kozlov
Ivan Lazarevich
Vasily Shamporov
N. Lyalyushkin
Yury Gorbachev
93
36
0
20 Feb 2020
Source Separation with Deep Generative Priors
Source Separation with Deep Generative Priors
V. Jayaram
John Thickstun
94
40
0
19 Feb 2020
Gravitational-wave parameter estimation with autoregressive neural
  network flows
Gravitational-wave parameter estimation with autoregressive neural network flows
Stephen R. Green
C. Simpson
J. Gair
BDL
130
86
0
18 Feb 2020
Conditional Mutual information-based Contrastive Loss for Financial Time
  Series Forecasting
Conditional Mutual information-based Contrastive Loss for Financial Time Series Forecasting
Hanwei Wu
Ather Gattami
M. Flierl
AI4TS
72
14
0
18 Feb 2020
On the Discrepancy between Density Estimation and Sequence Generation
On the Discrepancy between Density Estimation and Sequence Generation
Jason D. Lee
Dustin Tran
Orhan Firat
Kyunghyun Cho
41
11
0
17 Feb 2020
Interactive Text-to-Speech System via Joint Style Analysis
Interactive Text-to-Speech System via Joint Style Analysis
Yang Gao
Weiyi Zheng
Zhaojun Yang
Thilo Köhler
Christian Fuegen
Qing He
78
11
0
17 Feb 2020
Universal Value Density Estimation for Imitation Learning and
  Goal-Conditioned Reinforcement Learning
Universal Value Density Estimation for Imitation Learning and Goal-Conditioned Reinforcement Learning
Yannick Schroecker
Charles Isbell
OffRL
88
13
0
15 Feb 2020
Many-to-Many Voice Conversion using Conditional Cycle-Consistent
  Adversarial Networks
Many-to-Many Voice Conversion using Conditional Cycle-Consistent Adversarial Networks
Shindong Lee
Bonggu Ko
Keonnyeong Lee
In-Chul Yoo
Dongsuk Yook
GAN
66
34
0
15 Feb 2020
Data-Driven Symbol Detection via Model-Based Machine Learning
Data-Driven Symbol Detection via Model-Based Machine Learning
Nariman Farsad
Nir Shlezinger
Andrea J. Goldsmith
Yonina C. Eldar
74
49
0
14 Feb 2020
Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention
Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention
Yuma Koizumi
Kohei Yatabe
Marc Delcroix
Yoshiki Masuyama
Daiki Takeuchi
53
125
0
14 Feb 2020
Phase reconstruction based on recurrent phase unwrapping with deep
  neural networks
Phase reconstruction based on recurrent phase unwrapping with deep neural networks
Yoshiki Masuyama
Kohei Yatabe
Yuma Koizumi
Yasuhiro Oikawa
Noboru Harada
57
22
0
14 Feb 2020
Efficient And Scalable Neural Residual Waveform Coding With
  Collaborative Quantization
Efficient And Scalable Neural Residual Waveform Coding With Collaborative Quantization
Kai Zhen
Mi Suk Lee
Jongmo Sung
Seungkwon Beack
Minje Kim
98
20
0
13 Feb 2020
Deep Learning for Source Code Modeling and Generation: Models,
  Applications and Challenges
Deep Learning for Source Code Modeling and Generation: Models, Applications and Challenges
T. H. Le
Hao Chen
Muhammad Ali Babar
VLM
147
155
0
13 Feb 2020
The Conditional Entropy Bottleneck
The Conditional Entropy Bottleneck
Ian S. Fischer
OOD
125
122
0
13 Feb 2020
Explainable Deep Modeling of Tabular Data using TableGraphNet
Explainable Deep Modeling of Tabular Data using TableGraphNet
G. Terejanu
Jawad Chowdhury
Rezaur Rashid
Asif J. Chowdhury
LMTDFAtt
16
3
0
12 Feb 2020
ForecastNet: A Time-Variant Deep Feed-Forward Neural Network
  Architecture for Multi-Step-Ahead Time-Series Forecasting
ForecastNet: A Time-Variant Deep Feed-Forward Neural Network Architecture for Multi-Step-Ahead Time-Series Forecasting
J. Dabrowski
Yifan Zhang
Ashfaqur Rahman
AI4TS
35
36
0
11 Feb 2020
FastWave: Accelerating Autoregressive Convolutional Neural Networks on
  FPGA
FastWave: Accelerating Autoregressive Convolutional Neural Networks on FPGA
Shehzeen Samarah Hussain
Mojan Javaheripi
Paarth Neekhara
Ryan Kastner
F. Koushanfar
58
21
0
09 Feb 2020
RL-Duet: Online Music Accompaniment Generation Using Deep Reinforcement
  Learning
RL-Duet: Online Music Accompaniment Generation Using Deep Reinforcement Learning
Nan Jiang
Sheng Jin
Z. Duan
Changshui Zhang
OffRL
108
50
0
08 Feb 2020
Multimodal Controller for Generative Models
Multimodal Controller for Generative Models
Enmao Diao
Jie Ding
Vahid Tarokh
77
3
0
07 Feb 2020
Closing the Dequantization Gap: PixelCNN as a Single-Layer Flow
Closing the Dequantization Gap: PixelCNN as a Single-Layer Flow
Didrik Nielsen
Ole Winther
MQ
235
13
0
06 Feb 2020
Attentional networks for music generation
Attentional networks for music generation
G. Keerti
A. N. Vaishnavi
Prerana Mukherjee
A. Vidya
Gattineni Sai Sreenithya
Deeksha Nayab
MGen
36
24
0
06 Feb 2020
Prediction of head motion from speech waveforms with a
  canonical-correlation-constrained autoencoder
Prediction of head motion from speech waveforms with a canonical-correlation-constrained autoencoder
JinHong Lu
H. Shimodaira
44
8
0
05 Feb 2020
Continuous Melody Generation via Disentangled Short-Term Representations
  and Structural Conditions
Continuous Melody Generation via Disentangled Short-Term Representations and Structural Conditions
Kai Chen
Gus Xia
Shlomo Dubnov
MGen
67
18
0
05 Feb 2020
Vocoder-free End-to-End Voice Conversion with Transformer Network
Vocoder-free End-to-End Voice Conversion with Transformer Network
June-Woo Kim
H. Jung
Minho Lee
52
4
0
05 Feb 2020
BOFFIN TTS: Few-Shot Speaker Adaptation by Bayesian Optimization
BOFFIN TTS: Few-Shot Speaker Adaptation by Bayesian Optimization
Henry B. Moss
Vatsal Aggarwal
N. Prateek
Javier I. González
Roberto Barra-Chicote
BDL
51
57
0
04 Feb 2020
Acoustic anomaly detection via latent regularized gaussian mixture
  generative adversarial networks
Acoustic anomaly detection via latent regularized gaussian mixture generative adversarial networks
Chengwei Chen
Pan Chen
Lingyu Yang
Jinyuan Mo
Haichuan Song
Yuan Xie
Lizhuang Ma
22
1
0
04 Feb 2020
WeatherBench: A benchmark dataset for data-driven weather forecasting
WeatherBench: A benchmark dataset for data-driven weather forecasting
S. Rasp
P. Dueben
S. Scher
Jonathan A. Weyn
Soukayna Mouatadid
Nils Thuerey
AI4ClAI4TS
133
461
0
02 Feb 2020
Music2Dance: DanceNet for Music-driven Dance Generation
Music2Dance: DanceNet for Music-driven Dance Generation
Wenlin Zhuang
Congyi Wang
Siyu Xia
Jinxiang Chai
Yangang Wang
DiffM
23
2
0
02 Feb 2020
WaveTTS: Tacotron-based TTS with Joint Time-Frequency Domain Loss
WaveTTS: Tacotron-based TTS with Joint Time-Frequency Domain Loss
Rui Liu
Berrak Sisman
F. Bao
Guanglai Gao
Haizhou Li
125
14
0
02 Feb 2020
Single Channel Speech Enhancement Using Temporal Convolutional Recurrent
  Neural Networks
Single Channel Speech Enhancement Using Temporal Convolutional Recurrent Neural Networks
Jingdong Li
Hui Zhang
Xueliang Zhang
Changliang Li
53
9
0
02 Feb 2020
Training Keyword Spotters with Limited and Synthesized Speech Data
Training Keyword Spotters with Limited and Synthesized Speech Data
James Lin
Kevin Kilgour
Dominik Roblek
Matthew Sharifi
63
58
0
31 Jan 2020
Real-Time Well Log Prediction From Drilling Data Using Deep Learning
Real-Time Well Log Prediction From Drilling Data Using Deep Learning
Rayan Kanfar
Obai N. Shaikh
M. Yousefzadeh
T. Mukerji
13
36
0
28 Jan 2020
Lipreading using Temporal Convolutional Networks
Lipreading using Temporal Convolutional Networks
Brais Martínez
Pingchuan Ma
Stavros Petridis
Maja Pantic
238
241
0
23 Jan 2020
RPN: A Residual Pooling Network for Efficient Federated Learning
RPN: A Residual Pooling Network for Efficient Federated Learning
Anbu Huang
Yuanyuan Chen
Yang Liu
Tianjian Chen
Qiang Yang
FedML
74
11
0
23 Jan 2020
Unsupervised Representation Disentanglement using Cross Domain Features
  and Adversarial Learning in Variational Autoencoder based Voice Conversion
Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice Conversion
Wen-Chin Huang
Hao Luo
Hsin-Te Hwang
Chen-Chou Lo
Yu-Huai Peng
Yu Tsao
Hsin-Min Wang
DRL
63
42
0
22 Jan 2020
A Comprehensive Study on Temporal Modeling for Online Action Detection
A Comprehensive Study on Temporal Modeling for Online Action Detection
Wen Wang
Xiaojiang Peng
Yu Qiao
Jian Cheng
48
2
0
21 Jan 2020
Cut-Based Graph Learning Networks to Discover Compositional Structure of
  Sequential Video Data
Cut-Based Graph Learning Networks to Discover Compositional Structure of Sequential Video Data
Kyoung-Woon On
Eun-Sol Kim
Y. Heo
Byoung-Tak Zhang
BDL
52
6
0
17 Jan 2020
SqueezeWave: Extremely Lightweight Vocoders for On-device Speech
  Synthesis
SqueezeWave: Extremely Lightweight Vocoders for On-device Speech Synthesis
Bohan Zhai
Tianren Gao
Flora Xue
D. Rothchild
Bichen Wu
Joseph E. Gonzalez
Kurt Keutzer
64
27
0
16 Jan 2020
DDSP: Differentiable Digital Signal Processing
DDSP: Differentiable Digital Signal Processing
Jesse Engel
Lamtharn Hantrakul
Chenjie Gu
Adam Roberts
DiffM
188
381
0
14 Jan 2020
Unsupervised Audiovisual Synthesis via Exemplar Autoencoders
Unsupervised Audiovisual Synthesis via Exemplar Autoencoders
Kangle Deng
Aayush Bansal
Deva Ramanan
SSLVGen
74
12
0
13 Jan 2020
Previous
123...424344...606162
Next