ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.03499
  4. Cited By
WaveNet: A Generative Model for Raw Audio
v1v2 (latest)

WaveNet: A Generative Model for Raw Audio

12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
    DiffM
ArXiv (abs)PDFHTML

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,082 papers shown
Title
Neural Percussive Synthesis Parameterised by High-Level Timbral Features
Neural Percussive Synthesis Parameterised by High-Level Timbral Features
António Ramires
Pritish Chandna
Xavier Favory
Emilia Gómez
Xavier Serra
81
23
0
25 Nov 2019
Natural Image Manipulation for Autoregressive Models Using Fisher Scores
Natural Image Manipulation for Autoregressive Models Using Fisher Scores
Wilson Yan
Jonathan Ho
Pieter Abbeel
28
0
0
25 Nov 2019
Adversarial Learning of Privacy-Preserving and Task-Oriented
  Representations
Adversarial Learning of Privacy-Preserving and Task-Oriented Representations
Taihong Xiao
Yi-Hsuan Tsai
Kihyuk Sohn
Manmohan Chandraker
Ming-Hsuan Yang
76
75
0
22 Nov 2019
Go From the General to the Particular: Multi-Domain Translation with
  Domain Transformation Networks
Go From the General to the Particular: Multi-Domain Translation with Domain Transformation Networks
Yong Wang
Longyue Wang
Shuming Shi
Victor O.K. Li
Zhaopeng Tu
62
25
0
22 Nov 2019
Adversarial Robustness of Flow-Based Generative Models
Adversarial Robustness of Flow-Based Generative Models
Phillip E. Pope
Yogesh Balaji
Soheil Feizi
AAML
48
20
0
20 Nov 2019
Deep-Learning Estimation of Band Gap with the Reading-Periodic-Table
  Method and Periodic Convolution Layer
Deep-Learning Estimation of Band Gap with the Reading-Periodic-Table Method and Periodic Convolution Layer
Tomohiko Konno
71
1
0
16 Nov 2019
Granular Motor State Monitoring of Free Living Parkinson's Disease
  Patients via Deep Learning
Granular Motor State Monitoring of Free Living Parkinson's Disease Patients via Deep Learning
K. Yuksel
Jann Goschenhofer
H. V. Varma
U. Fietzek
Franz MJ Pfister
OOD
30
0
0
15 Nov 2019
Deep Long Audio Inpainting
Deep Long Audio Inpainting
Ya-Liang Chang
Kuan-Ying Lee
Po-Yu Wu
Hung-yi Lee
Winston H. Hsu
68
33
0
15 Nov 2019
Seq-U-Net: A One-Dimensional Causal U-Net for Efficient Sequence
  Modelling
Seq-U-Net: A One-Dimensional Causal U-Net for Efficient Sequence Modelling
Ruizhe Zhao
Brian K. Vogel
Tanvir Ahmed
Wayne Luk
61
37
0
14 Nov 2019
Compressive Transformers for Long-Range Sequence Modelling
Compressive Transformers for Long-Range Sequence Modelling
Jack W. Rae
Anna Potapenko
Siddhant M. Jayakumar
Timothy Lillicrap
RALMVLMKELM
105
656
0
13 Nov 2019
Rate-Regularization and Generalization in VAEs
Rate-Regularization and Generalization in VAEs
Alican Bozkurt
Babak Esmaeili
Jean-Baptiste Tristan
Dana H. Brooks
Jennifer G. Dy
Jan-Willem van de Meent
DRL
92
8
0
11 Nov 2019
GMAN: A Graph Multi-Attention Network for Traffic Prediction
GMAN: A Graph Multi-Attention Network for Traffic Prediction
Chuanpan Zheng
Xiaoliang Fan
Cheng-Yu Wang
Jianzhong Qi
AI4TSAI4CE
154
1,406
0
11 Nov 2019
Generative Autoregressive Networks for 3D Dancing Move Synthesis from
  Music
Generative Autoregressive Networks for 3D Dancing Move Synthesis from Music
Hyemin Ahn
Jaehun Kim
Kihyun Kim
Songhwai Oh
GAN
81
44
0
11 Nov 2019
Feedback Recurrent AutoEncoder
Feedback Recurrent AutoEncoder
Yang Yang
Guillaume Sautière
J. Jon Ryu
Taco S. Cohen
106
21
0
11 Nov 2019
Multimodal Intelligence: Representation Learning, Information Fusion,
  and Applications
Multimodal Intelligence: Representation Learning, Information Fusion, and Applications
Chao Zhang
Zichao Yang
Xiaodong He
Li Deng
HAIAI4TS
122
338
0
10 Nov 2019
Transformation of low-quality device-recorded speech to high-quality
  speech using improved SEGAN model
Transformation of low-quality device-recorded speech to high-quality speech using improved SEGAN model
Seyyed Saeed Sarfjoo
Xin Wang
G. Henter
Jaime Lorenzo-Trueba
Shinji Takaki
Junichi Yamagishi
45
8
0
10 Nov 2019
Characterizing dynamically varying acoustic scenes from egocentric audio
  recordings in workplace setting
Characterizing dynamically varying acoustic scenes from egocentric audio recordings in workplace setting
Arindam Jati
Amrutha Nadarajan
Karel Mundnich
Shrikanth Narayanan
34
2
0
10 Nov 2019
XceptionTime: A Novel Deep Architecture based on Depthwise Separable
  Convolutions for Hand Gesture Classification
XceptionTime: A Novel Deep Architecture based on Depthwise Separable Convolutions for Hand Gesture Classification
E. Rahimian
Soheil Zabihi
S. F. Atashzar
A. Asif
Arash Mohammadi
71
45
0
09 Nov 2019
On the Relationship between Self-Attention and Convolutional Layers
On the Relationship between Self-Attention and Convolutional Layers
Jean-Baptiste Cordonnier
Andreas Loukas
Martin Jaggi
179
535
0
08 Nov 2019
Teacher-Student Training for Robust Tacotron-based TTS
Teacher-Student Training for Robust Tacotron-based TTS
Rui Liu
Berrak Sisman
Jingdong Li
F. Bao
Guanglai Gao
Haizhou Li
109
38
0
07 Nov 2019
Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework
Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework
Mingbo Ma
Baigong Zheng
Kaibo Liu
Renjie Zheng
Hairong Liu
Kainan Peng
Kenneth Church
Liang Huang
66
31
0
07 Nov 2019
Deep Hedging: Learning to Simulate Equity Option Markets
Deep Hedging: Learning to Simulate Equity Option Markets
Magnus Wiese
Lianjun Bai
Ben Wood
Hans Buehler
GAN
90
69
0
05 Nov 2019
Emotional speech synthesis with rich and granularized control
Emotional speech synthesis with rich and granularized control
Seyun Um
Sangshin Oh
Kyungguen Byun
Inseon Jang
C. Ahn
Hong-Goo Kang
85
90
0
05 Nov 2019
ASVspoof 2019: A large-scale public database of synthesized, converted
  and replayed speech
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech
Xin Wang
Junichi Yamagishi
Massimiliano Todisco
Héctor Delgado
A. Nautsch
...
J. Bonastre
Avashna Govender
S. Ronanki
Jing-Xuan Zhang
Zhenhua Ling
99
12
0
05 Nov 2019
The frontier of simulation-based inference
The frontier of simulation-based inference
Kyle Cranmer
Johann Brehmer
Gilles Louppe
AI4CE
277
859
0
04 Nov 2019
Deep-Gap: A deep learning framework for forecasting crowdsourcing
  supply-demand gap based on imaging time series and residual learning
Deep-Gap: A deep learning framework for forecasting crowdsourcing supply-demand gap based on imaging time series and residual learning
Ahmed Ben Said
A. Erradi
AI4TS
40
7
0
02 Nov 2019
Deep convolutional neural networks for multi-scale time-series
  classification and application to disruption prediction in fusion devices
Deep convolutional neural networks for multi-scale time-series classification and application to disruption prediction in fusion devices
R. Churchill
the DIII-D team
AI4CE
36
10
0
31 Oct 2019
Neural Density Estimation and Likelihood-free Inference
Neural Density Estimation and Likelihood-free Inference
George Papamakarios
BDLDRL
100
47
0
29 Oct 2019
Disentangling Timbre and Singing Style with Multi-singer Singing
  Synthesis System
Disentangling Timbre and Singing Style with Multi-singer Singing Synthesis System
Juheon Lee
Hyeong-Seok Choi
Junghyun Koo
Kyogu Lee
45
18
0
29 Oct 2019
Spoofing Speaker Verification Systems with Deep Multi-speaker
  Text-to-speech Synthesis
Spoofing Speaker Verification Systems with Deep Multi-speaker Text-to-speech Synthesis
Mingrui Yuan
Z. Duan
23
1
0
29 Oct 2019
Effect of choice of probability distribution, randomness, and search
  methods for alignment modeling in sequence-to-sequence text-to-speech
  synthesis using hard alignment
Effect of choice of probability distribution, randomness, and search methods for alignment modeling in sequence-to-sequence text-to-speech synthesis using hard alignment
Yusuke Yasuda
Xin Wang
Junichi Yamagishi
28
2
0
28 Oct 2019
Transferring neural speech waveform synthesizers to musical instrument
  sounds generation
Transferring neural speech waveform synthesizers to musical instrument sounds generation
Yi Zhao
Xin Wang
Lauri Juvela
Junichi Yamagishi
92
17
0
27 Oct 2019
Implicit Posterior Variational Inference for Deep Gaussian Processes
Implicit Posterior Variational Inference for Deep Gaussian Processes
Haibin Yu
Yizhou Chen
Zhongxiang Dai
K. H. Low
Patrick Jaillet
88
43
0
26 Oct 2019
Multi-Reference Neural TTS Stylization with Adversarial Cycle
  Consistency
Multi-Reference Neural TTS Stylization with Adversarial Cycle Consistency
M. Whitehill
Shuang Ma
Daniel J. McDuff
Yale Song
111
35
0
25 Oct 2019
Study of Deep Generative Models for Inorganic Chemical Compositions
Study of Deep Generative Models for Inorganic Chemical Compositions
Yoshihide Sawada
Koji Morikawa
Mikiya Fujii
GAN
69
13
0
25 Oct 2019
Parallel WaveGAN: A fast waveform generation model based on generative
  adversarial networks with multi-resolution spectrogram
Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram
Ryuichi Yamamoto
Eunwoo Song
Jae-Min Kim
195
821
0
25 Oct 2019
Hierarchical Representation Learning in Graph Neural Networks with Node
  Decimation Pooling
Hierarchical Representation Learning in Graph Neural Networks with Node Decimation Pooling
F. Bianchi
Daniele Grattarola
L. Livi
Cesare Alippi
199
49
0
24 Oct 2019
Towards Fine-Grained Prosody Control for Voice Conversion
Towards Fine-Grained Prosody Control for Voice Conversion
Zheng Lian
Zhengqi Wen
70
19
0
24 Oct 2019
Vision-Infused Deep Audio Inpainting
Vision-Infused Deep Audio Inpainting
Hang Zhou
Ziwei Liu
Lingfeng Guo
Ping Luo
Dahua Lin
142
88
0
24 Oct 2019
Fast and High-Quality Singing Voice Synthesis System based on
  Convolutional Neural Networks
Fast and High-Quality Singing Voice Synthesis System based on Convolutional Neural Networks
Kazuhiro Nakamura
Shinji Takaki
Kei Hashimoto
Keiichiro Oura
Yoshihiko Nankaku
K. Tokuda
84
19
0
24 Oct 2019
ESPnet-TTS: Unified, Reproducible, and Integratable Open Source
  End-to-End Text-to-Speech Toolkit
ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit
Tomoki Hayashi
Ryuichi Yamamoto
Katsuki Inoue
Takenori Yoshimura
Shinji Watanabe
Tomoki Toda
K. Takeda
Yu Zhang
Xu Tan
VLM
93
205
0
24 Oct 2019
Expression Analysis Based on Face Regions in Read-world Conditions
Expression Analysis Based on Face Regions in Read-world Conditions
Zheng Lian
Ya Li
J. Tao
Jian Huang
Mingyue Niu
CVBM
47
58
0
23 Oct 2019
Unifying Variational Inference and PAC-Bayes for Supervised Learning
  that Scales
Unifying Variational Inference and PAC-Bayes for Supervised Learning that Scales
Sanjay Thakur
H. V. Hoof
Gunshi Gupta
David Meger
BDL
42
2
0
23 Oct 2019
Detecting Out-of-Distribution Inputs in Deep Neural Networks Using an
  Early-Layer Output
Detecting Out-of-Distribution Inputs in Deep Neural Networks Using an Early-Layer Output
Vahdat Abdelzad
Krzysztof Czarnecki
Rick Salay
Taylor Denouden
Sachin Vernekar
Buu Phan
OODD
64
47
0
23 Oct 2019
Complex Transformer: A Framework for Modeling Complex-Valued Sequence
Complex Transformer: A Framework for Modeling Complex-Valued Sequence
Muqiao Yang
Martin Q. Ma
Dongyu Li
Yao-Hung Hubert Tsai
Ruslan Salakhutdinov
ViT
53
38
0
22 Oct 2019
GANspection
GANspection
Hammad A. Ayyubi
GAN
18
0
0
21 Oct 2019
You May Not Need Order in Time Series Forecasting
You May Not Need Order in Time Series Forecasting
Yunkai Zhang
Qiao Jiang
Shurui Li
Xiaoyong Jin
Xueying Ma
Xifeng Yan
AI4TS
28
3
0
21 Oct 2019
XL-Editor: Post-editing Sentences with XLNet
XL-Editor: Post-editing Sentences with XLNet
Yong-Siang Shih
Wei-Cheng Chang
Yiming Yang
KELM
79
11
0
19 Oct 2019
Label-efficient audio classification through multitask learning and
  self-supervision
Label-efficient audio classification through multitask learning and self-supervision
Tyler Lee
Ting Gong
Suchismita Padhy
Andrew Rouditchenko
A. Ndirango
SSLVLM
62
7
0
19 Oct 2019
Decoupling feature propagation from the design of graph auto-encoders
Decoupling feature propagation from the design of graph auto-encoders
P. Scherer
Helena Andrés-Terré
Pietro Lio
M. Jamnik
BDL
16
1
0
18 Oct 2019
Previous
123...444546...606162
Next