ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.03499
  4. Cited By
WaveNet: A Generative Model for Raw Audio
v1v2 (latest)

WaveNet: A Generative Model for Raw Audio

12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
    DiffM
ArXiv (abs)PDFHTML

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,082 papers shown
Title
Learning Gradient Fields for Shape Generation
Learning Gradient Fields for Shape Generation
Ruojin Cai
Guandao Yang
Hadar Averbuch-Elor
Jinwei Gu
Serge J. Belongie
Noah Snavely
B. Hariharan
3DPC
134
286
0
14 Aug 2020
Textual Echo Cancellation
Textual Echo Cancellation
Shaojin Ding
Ye Jia
Ke Hu
Quan Wang
81
8
0
13 Aug 2020
Prosody Learning Mechanism for Speech Synthesis System Without Text
  Length Limit
Prosody Learning Mechanism for Speech Synthesis System Without Text Length Limit
Zhen Zeng
Jianzong Wang
Ning Cheng
Jing Xiao
61
8
0
13 Aug 2020
Audio- and Gaze-driven Facial Animation of Codec Avatars
Audio- and Gaze-driven Facial Animation of Codec Avatars
Alexander Richard
Colin S. Lea
Shugao Ma
Juergen Gall
Fernando de la Torre
Yaser Sheikh
CVBM
77
81
0
11 Aug 2020
Bunched LPCNet : Vocoder for Low-cost Neural Text-To-Speech Systems
Bunched LPCNet : Vocoder for Low-cost Neural Text-To-Speech Systems
Ravichander Vipperla
Sangjun Park
Kihyun Choo
Samin S. Ishtiaq
Kyoungbo Min
S. Bhattacharya
Abhinav Mehrotra
Alberto Gil C. P. Ramos
Nicholas D. Lane
72
26
0
11 Aug 2020
DeepDrummer : Generating Drum Loops using Deep Learning and a Human in
  the Loop
DeepDrummer : Generating Drum Loops using Deep Learning and a Human in the Loop
Guillaume Alain
Maxime Chevalier-Boisvert
Frédéric Osterrath
Remi Piche-Taillefer
HAI
81
6
0
10 Aug 2020
Deep learning for photoacoustic imaging: a survey
Deep learning for photoacoustic imaging: a survey
Changchun Yang
Hengrong Lan
Feng Gao
Fei Gao
VLMMedIm
59
21
0
10 Aug 2020
SpeedySpeech: Efficient Neural Speech Synthesis
SpeedySpeech: Efficient Neural Speech Synthesis
Jan Vainer
Ondrej Dusek
66
43
0
09 Aug 2020
Speaker Conditional WaveRNN: Towards Universal Neural Vocoder for Unseen
  Speaker and Recording Conditions
Speaker Conditional WaveRNN: Towards Universal Neural Vocoder for Unseen Speaker and Recording Conditions
D. Paul
Yannis Pantazis
Y. Stylianou
DRL
59
30
0
09 Aug 2020
Deep MOS Predictor for Synthetic Speech Using Cluster-Based Modeling
Deep MOS Predictor for Synthetic Speech Using Cluster-Based Modeling
Yeunju Choi
Youngmoon Jung
Hoirin Kim
139
26
0
09 Aug 2020
An Overview of Voice Conversion and its Challenges: From Statistical
  Modeling to Deep Learning
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep Learning
Berrak Sisman
Junichi Yamagishi
Simon King
Haizhou Li
BDL
141
329
0
09 Aug 2020
Pretraining Techniques for Sequence-to-Sequence Voice Conversion
Pretraining Techniques for Sequence-to-Sequence Voice Conversion
Wen-Chin Huang
Tomoki Hayashi
Yi-Chiao Wu
Hirokazu Kameoka
Tomoki Toda
121
40
0
07 Aug 2020
A Multi-Task Learning Approach for Human Activity Segmentation and
  Ergonomics Risk Assessment
A Multi-Task Learning Approach for Human Activity Segmentation and Ergonomics Risk Assessment
Behnoosh Parsa
A. Banerjee
97
2
0
07 Aug 2020
DurIAN-SC: Duration Informed Attention Network based Singing Voice
  Conversion System
DurIAN-SC: Duration Informed Attention Network based Singing Voice Conversion System
Liqiang Zhang
Chengzhu Yu
Heng Lu
Chao Weng
Chunlei Zhang
Yusong Wu
Xiang Xie
Zijin Li
Dong Yu
68
34
0
07 Aug 2020
Unsupervised Cross-Domain Singing Voice Conversion
Unsupervised Cross-Domain Singing Voice Conversion
Adam Polyak
Lior Wolf
Yossi Adi
Yaniv Taigman
58
44
0
06 Aug 2020
Generative Adversarial Networks for Image and Video Synthesis:
  Algorithms and Applications
Generative Adversarial Networks for Image and Video Synthesis: Algorithms and Applications
Xuan Li
Xun Huang
Jiahui Yu
Ting-Chun Wang
Arun Mallya
GAN
170
155
0
06 Aug 2020
HooliGAN: Robust, High Quality Neural Vocoding
HooliGAN: Robust, High Quality Neural Vocoding
Ollie McCarthy
Zo Ahmed
98
14
0
06 Aug 2020
Zero-Shot Multi-View Indoor Localization via Graph Location Networks
Zero-Shot Multi-View Indoor Localization via Graph Location Networks
Meng-Jiun Chiou
Zhenguang Liu
Yifang Yin
Anan Liu
Roger Zimmermann
64
27
0
06 Aug 2020
PPSpeech: Phrase based Parallel End-to-End TTS System
PPSpeech: Phrase based Parallel End-to-End TTS System
Yahuan Cong
Ran Zhang
Jian Luan
47
3
0
06 Aug 2020
Ultrasound-based Articulatory-to-Acoustic Mapping with WaveGlow Speech
  Synthesis
Ultrasound-based Articulatory-to-Acoustic Mapping with WaveGlow Speech Synthesis
Tamás Gábor Csapó
Csaba Zainkó
L. Tóth
G. Gosztolya
Alexandra Markó
61
33
0
06 Aug 2020
Recognition-Synthesis Based Non-Parallel Voice Conversion with
  Adversarial Learning
Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning
Jing-Xuan Zhang
Zhenhua Ling
Lirong Dai
81
6
0
05 Aug 2020
Expressive TTS Training with Frame and Style Reconstruction Loss
Expressive TTS Training with Frame and Style Reconstruction Loss
Rui Liu
Berrak Sisman
Guanglai Gao
Haizhou Li
112
73
0
04 Aug 2020
Neural Granular Sound Synthesis
Neural Granular Sound Synthesis
Adrien Bitton
P. Esling
Tatsuya Harada
83
7
0
04 Aug 2020
A Spectral Energy Distance for Parallel Speech Synthesis
A Spectral Energy Distance for Parallel Speech Synthesis
A. Gritsenko
Tim Salimans
Rianne van den Berg
Jasper Snoek
Nal Kalchbrenner
76
70
0
03 Aug 2020
One Model, Many Languages: Meta-learning for Multilingual Text-to-Speech
One Model, Many Languages: Meta-learning for Multilingual Text-to-Speech
Tomás Nekvinda
Ondrej Dusek
72
57
0
03 Aug 2020
Unacceptable, where is my privacy? Exploring Accidental Triggers of
  Smart Speakers
Unacceptable, where is my privacy? Exploring Accidental Triggers of Smart Speakers
Lea Schonherr
Maximilian Golla
Thorsten Eisenhofer
Jan Wiele
D. Kolossa
Thorsten Holz
63
41
0
02 Aug 2020
Principles and Algorithms for Forecasting Groups of Time Series:
  Locality and Globality
Principles and Algorithms for Forecasting Groups of Time Series: Locality and Globality
Pablo Montero-Manso
Rob J. Hyndman
AI4TS
102
139
0
02 Aug 2020
Diet deep generative audio models with structured lottery
Diet deep generative audio models with structured lottery
P. Esling
Ninon Devis
Adrien Bitton
Antoine Caillon
Axel Chemla-Romeu-Santos
Constance Douwes
105
6
0
31 Jul 2020
Traffic Control Gesture Recognition for Autonomous Vehicles
Traffic Control Gesture Recognition for Autonomous Vehicles
Julian Wiederer
Arij Bouazizi
U. Kressel
Vasileios Belagiannis
95
53
0
31 Jul 2020
An Empirical Survey of Data Augmentation for Time Series Classification
  with Neural Networks
An Empirical Survey of Data Augmentation for Time Series Classification with Neural Networks
Brian Kenji Iwana
S. Uchida
AI4TS
90
506
0
31 Jul 2020
Rewriting a Deep Generative Model
Rewriting a Deep Generative Model
David Bau
Steven Liu
Tongzhou Wang
Jun-Yan Zhu
Antonio Torralba
GANDRL
108
140
0
30 Jul 2020
Anomaly Detection at Scale: The Case for Deep Distributional Time Series
  Models
Anomaly Detection at Scale: The Case for Deep Distributional Time Series Models
Fadhel Ayed
Lorenzo Stella
Tim Januschowski
Jan Gasthaus
AI4TS
103
10
0
30 Jul 2020
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested
  Adversarial Network
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Jinhyeok Yang
Junmo Lee
Young-Ik Kim
Hoonyoung Cho
Injung Kim
85
73
0
30 Jul 2020
Privacy-preserving Voice Analysis via Disentangled Representations
Privacy-preserving Voice Analysis via Disentangled Representations
Ranya Aloufi
Hamed Haddadi
David E. Boyle
DRL
130
58
0
29 Jul 2020
TrajGAIL: Generating Urban Vehicle Trajectories using Generative
  Adversarial Imitation Learning
TrajGAIL: Generating Urban Vehicle Trajectories using Generative Adversarial Imitation Learning
Seongjin Choi
Jiwon Kim
H. Yeo
GAN
100
131
0
28 Jul 2020
Federated Self-Supervised Learning of Multi-Sensor Representations for
  Embedded Intelligence
Federated Self-Supervised Learning of Multi-Sensor Representations for Embedded Intelligence
Aaqib Saeed
Flora D. Salim
T. Ozcelebi
J. Lukkien
FedMLSSL
193
100
0
25 Jul 2020
Towards Game Design via Creative Machine Learning (GDCML)
Towards Game Design via Creative Machine Learning (GDCML)
Anurag Sarkar
Seth Cooper
AI4CE
72
21
0
25 Jul 2020
Multi-speaker Emotion Conversion via Latent Variable Regularization and
  a Chained Encoder-Decoder-Predictor Network
Multi-speaker Emotion Conversion via Latent Variable Regularization and a Chained Encoder-Decoder-Predictor Network
Ravi Shankar
Hsi-Wei Hsieh
N. Charon
A. Venkataraman
120
11
0
25 Jul 2020
Non-parallel Emotion Conversion using a Deep-Generative Hybrid Network
  and an Adversarial Pair Discriminator
Non-parallel Emotion Conversion using a Deep-Generative Hybrid Network and an Adversarial Pair Discriminator
Ravi Shankar
Jacob Sager
A. Venkataraman
GAN
117
20
0
25 Jul 2020
Robust Front-End for Multi-Channel ASR using Flow-Based Density
  Estimation
Robust Front-End for Multi-Channel ASR using Flow-Based Density Estimation
Xiaoyuan Yi
Hyeonseung Lee
Wenhao Li
Hyung Yong Kim
Nam Soo Kim
84
22
0
25 Jul 2020
Hierarchical Protein Function Prediction with Tail-GNNs
Hierarchical Protein Function Prediction with Tail-GNNs
Stefan Spalević
Petar Velivcković
Jovana Kovavcević
Mladen Nikolic
AI4CE
66
5
0
24 Jul 2020
SeismoFlow -- Data augmentation for the class imbalance problem
SeismoFlow -- Data augmentation for the class imbalance problem
R. Milidiú
Luis Müller
AI4TS
30
1
0
23 Jul 2020
A Transfer Learning End-to-End ArabicText-To-Speech (TTS) Deep
  Architecture
A Transfer Learning End-to-End ArabicText-To-Speech (TTS) Deep Architecture
Fady K. Fahmy
M. Khalil
Hazem M. Abbas
55
21
0
22 Jul 2020
CrossTransformers: spatially-aware few-shot transfer
CrossTransformers: spatially-aware few-shot transfer
Carl Doersch
Ankush Gupta
Andrew Zisserman
ViT
294
338
0
22 Jul 2020
Rethinking CNN Models for Audio Classification
Rethinking CNN Models for Audio Classification
Kamalesh Palanisamy
Dipika Singhania
Angela Yao
SSL
83
146
0
22 Jul 2020
Foley Music: Learning to Generate Music from Videos
Foley Music: Learning to Generate Music from Videos
Chuang Gan
Deng Huang
Peihao Chen
J. Tenenbaum
Antonio Torralba
VGen
75
139
0
21 Jul 2020
Deep Learning Techniques for Future Intelligent Cross-Media Retrieval
Deep Learning Techniques for Future Intelligent Cross-Media Retrieval
S. Rehman
M. Waqas
Shanshan Tu
Anis Koubaa
O. Rehman
Jawad Ahmad
Muhammad Hanif
Zhu Han
49
6
0
21 Jul 2020
Complex Skill Acquisition Through Simple Skill Imitation Learning
Complex Skill Acquisition Through Simple Skill Imitation Learning
Pranay Pasula
8
0
0
20 Jul 2020
Temporal Pointwise Convolutional Networks for Length of Stay Prediction
  in the Intensive Care Unit
Temporal Pointwise Convolutional Networks for Length of Stay Prediction in the Intensive Care Unit
Emma Rocheteau
Pietro Lio
Stephanie L. Hyland
OOD
64
60
0
18 Jul 2020
MTL2L: A Context Aware Neural Optimiser
MTL2L: A Context Aware Neural Optimiser
N. Kuo
Mehrtash Harandi
Nicolas Fourrier
Christian J. Walder
Gabriela Ferraro
H. Suominen
36
0
0
18 Jul 2020
Previous
123...373839...606162
Next